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End chronic kidney 
disease neglect 


It is unacceptable that kidney-dialysis 
technology has changed little in the past 
five decades. 


ialysis almost immediately saved lives when 
it was invented in the first half of the twenti- 
eth century to treat kidney disease by safely 
accessing a patient’s blood supply and filtering 
toxins normally removed by the kidneys. 

The original dialysis machine, arudimentary contraption 
invented by physician Willem Kolffin the early 1940s, was 
made fromcellophanetubes and a wooden drum. Although 
today’s machines are manufactured industrially, operating 
technologies have changedlittle since the 1960s. And that, 
as we report in a Feature on page 186, is a problem. 

The World Health Organization estimates that, each year, 
around 1.2 million people worldwide die from kidney fail- 
ure. Thisis partly aresult of the number of people with high 
blood pressure and diabetes, which strain and damage the 
kidneys. But a combination of dialysis technology’s prac- 
tical limitations and affordability also means that fewer 
than half of the people who need treatment can access it. 

In Africa, just 16% of people with kidney disease get dial- 
ysis, and even fewer can sustain the cost of treatment for 
morethana few months. Inthe United States, where dialysis 
can cost up to US$91,000 per patient per year, fewer than 
half ofthose onthe most common form of dialysis survive 
for more than five years from the onset of kidney failure. 

A fundamental problem is that dialysis involves con- 
necting patients to machines that can weigh more than 
100 kilograms. For most people, that necessitates regular 
visits to a hospital or dialysis clinic. A typical treatment 
regime can take 12 hours, spread over 3 weekly sessions 
that see toxins filtered from the blood and levels of vari- 
ous salts and minerals recalibrated. For patients, this is an 
ordeal that is both energy-sapping and time-consuming. 

Dialysis also consumes resources. Between 120 and 
240 litres of filtered water are needed for each 4-hour 
session. By one estimate, the annual requirements of the 
dialysis provided around the world include more than 
156 billion litres of water and roughly 1.62 billion kilowatt 
hours of power — roughly equivalent to the electricity 
needed to power asmall European city fora year. Dialysis 
also generates some 625,000 tonnes of plastic waste. 

Solutions include making dialysis more portable — so 
that it can be carried out at home or on the move — and 
finding ways to doit with less water and power. This would 
be of particular benefit to patients in developing countries. 

Promising technologies are being developed to make 
the machines smaller and more portable. Butit is not clear 
whether these will reach many of those who need them. 
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Despite the clear need for innovation, research funding 
is low. Last year, the US government and the American 
Society of Nephrology announceda plan to raise $250 mil- 
lion over 5 years for research. This is welcome, but so far 
the collaboration, called Kidney X, has disbursed only 
$1.1 million in grants. 

The reality is that chronic kidney disease needs the kind 
of coordinated global effort, involving funders, researchers 
and patient groups, that some other conditions attract. 
Together, these groups must support more research 
into the mechanisms that underlie kidney disease, and 
approaches to prevent its development and progression. 

At the same time, countries could encourage patients’ 
family members to consider kidney donation. On average, 
transplant patients live longer than do those on dialysis — 
reducing the heavy health and economic burdens of this 
neglected killer. 


DARPA ‘lookalikes’ 
must ground their 
dreams in reality 


The US Defense Advanced Research Projects 
Agency knows that its freedom to invent 
comes with responsibility. 


he government of UK Prime Minister Boris John- 

sonis racing ahead with plans for an Advanced 

Research Projects Agency (ARPA), modelled on 

the US original. The country is looking to boost 
technological competitiveness as it withdraws 

fromthe European Union. Precise details ofits ARPA plans 
are yet to be revealed, but the available funding is expected 
tocometo around £800 million (US$1 billion) over 5 years. 
The US Defense Advanced Research Projects Agency 
(DARPA), which supports ambitious technologies for 
military objectives, was launched in 1958 by president 
Dwight Eisenhower. The impetus was the Soviet Union’s 
1957 launch of the first artificial satellite, Sputnik, which 
demonstrated a level of technological prowess that 
shocked Western nations. Eisenhower’s ambition for 
DARPA — established in the same year as NASA — was that 
the US military would never again be left behind in this way. 
DARPA’s best-knowninvestments include research onthe 
first global satellite-navigation system (knownas Transit), 
stealth aircraft and the Internet’s precursor, ARPANET. 
Today, the many projects funded by the agency include 
work on developing treatments to regrow severed limbs. 
DARPA spends about $3.5 billion a year, which is less than 
1% of the total US public and private research and devel- 
opment budget. It’s a small enough proportion to justify 
DARPA’s reputation for taking on riskier ideas and having 
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a higher tolerance for failure than conventional funding 
agencies. One idea that famously didn’t make it was the haf- 
nium bomb, based on the flawed belief that large amounts 
of energy can be released by bombarding the isotope 
hafnium-178 with X-rays. 

Although DARPA’s research programme managers have 
relatively more flexibility in what research to fund and how, 
nocountry has been able to replicate the scale of the agen- 
cy’s success. Even in the United States its achievements 
remain unrivalled. A different advanced research projects 
agency for newenergy technologies (ARPA-E) — launched 
in 2009 — is under constant threat of being eliminated by 
the administration of President Donald Trump. 

One reason why DARPA is so hard to replicate, says 
DARPA historian Sharon Weinberger, is because the agen- 
cy’s projects have a resource that the others lack. “They 
have acustomer with the deepest pockets inthe world,” she 
says. The US Department of Defense’s annual budgets for 
research and procurement, totalling $190 billion, enable 
it to fund successful prototypes ona large scale, to test 
whether they might be commercially viable. 


Risk management 


A closer look at DARPA shows how its managers pursue 
bold ideas while controlling risk. Ina Comment article 
on page 190, members of a team working with — and in — 
the agency’s Biological Technologies Office in Arlington, 
Virginia, report on an initiative launched in 2016. This 
assigns an independent validation team to projects to 
troubleshoot and reproduce research proposals. This 
‘shadow team’ meets with the ‘performing team’ to learn 
the precise protocols and establish the necessary condi- 
tions to reproduce projects, and the two groups make joint 
presentations to the programme manager on progress. 

The work is hard — one project took aslong as 20 months 
toreproduce. It is also expensive: it costs between 3% and 
8% of aprogramme’s funds to make sure the technologies 
work. But programme managers Say it is worth the invest- 
ment, and the model demonstrates a more careful side 
to the agency than DARPA’s daring image tends to evoke. 

These efforts are instructive, both for dreams of a UK 
ARPA and for science overall. Some of UK ARPA’s support- 
ers would like to see cutting-edge technologies devel- 
oped within 15 years — and a certain ruthlessness when it 
comes to axing the least promising ones. But an ambitious 
technology goal in, say, regenerative medicine or remote 
sensing will probably need longer before careful study can 
make the promises — and risks — clear. 

Researchers, their managers in universities, and fund- 
ing agencies all understand why effective due diligenceis 
essential to projects. But it can be difficult for these voices 
to be heard when no less than the prime minister’s office 
celebrates ARPA as “high-risk, high-pay-off research”, and 
characterizes bureaucracy as “form-filling”. 

Any nation looking to replicate DARPA must realize 
that you can’t reap the rewards of high-risk research with- 
out investing in meticulous preparation and verification. 
The freedom to pursue bold ideas comes with added 
responsibility. 
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Alack of locust 
preparedness 


Locusts are causing a food crisis that can no 
longer be ignored. 


hile all eyes are on the coronavirus 
outbreak, an under-reported emergency 
is threatening food, health andjobs on 
three continents. For the past several 
months, swarms of the desert locust 
Schistocerca gregaria — some swarms the size of cities — 
have devoured crops in East Africa, the Middle East and 
south Asia. Some 20 million people are facing a food crisis. 

Governments have been left under-prepared for the 
scale of these attacks, andthe Rome-based United Nations 
Food and Agriculture Organization (FAO) has appealed for 
US$138 million in urgent funding —some of which is needed 
to lease aircraft that can drop chemicals to curb the spread. 

Locusts are an annual fixture after the rainy season — 
laying their eggs in moist soils. But the size of this year’s 
swarms — the biggest for at least 25 years — are duein part 
to unseasonal and often torrential rains in many areas, 
including Eritrea, Ethiopia, Iran, Pakistan, Saudi Arabia 
and Yemen. The FAO’s calls must be heeded — especially 
as more rains are to come, bringing the potential for yet 
more devastation. But at the same time, some of the gov- 
ernments concerned must ask themselves what more they 
could be doing to limit the damage. 

In many countries, meteorological offices share climate 
and weather data with what are called desert-locust-control 
offices. These are set up to forecast locust infestations, and 
to advise on potential crop losses and mitigation measures. 
East Africa has a regional body called the Desert Locust 
Control Organization for East Africa, headquartered in 
Addis Ababa and funded by nine African countries. 

Nature has been told that some members — such as 
Djibouti, Somalia and Sudan — have been unable to pay their 
membership fees for many years and collectively owe the 
organization more than $8 million. Uganda, which partially 
clearedits arrears last month, still owes $2 million. Somalia 
and Sudan have both experienced severe conflict, so it’s 
understandable that locusts have not beena priority. But the 
insects can be just as threatening to well-being, and if indi- 
vidual countries can’t pay their way, then the African Union 
orthe UNneedtostep in. Paying into locust-control offices 
should be regarded as keeping up an insurance policy. 
The hope with insurance is that it’s never needed, but the 
facility must always be there should the need arise. 

The focus now is rightly on emergency food relief. But 
preparations for coming swarms need to be accelerated. 
The African Union and the UN must ensure that countries’ 
desert-locust organizations, informed by the latest 
research, are better equipped to help when thetime comes. 
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HENRIETTA GARDEN 


A personal take onscience and society 


World view 


By Angela Saini 


Want to do better science? 
Admit you're not objective 


When science is viewed in isolation from 
the past and politics, it’s easier for those 
with bad intentions to revive dangerous 
and discredited ideas. 


ne of the world’s leading universities — 

University College London (UCL) — has 

completed an inquiry into its support for 

the discredited pseudoscience of eugenics. 

Funds linked to Francis Galton, a racist who 
believed it was possible to improve the British popula- 
tion through selective breeding, and who founded the 
Eugenics Records Office at UCL in 1904, continue to line 
theuniversity’s coffers to the value of more than £800,000 
(US$1 million). 

The inquiry’s report, released on 28 February, recom- 
mended renaming lecture theatres and buildings bearing 
Galton’s name and that of another prominent geneticist. 
Although this is welcome, it does not acknowledge just 
how much yesterday’s mistakes survive in modernscience. 

As I found while writing my 2019 book Superior: The 
Return of Race Science, geneticists today rightly treat 
eugenics as a laughable proposition, and the concept of 
biological race — the belief that humans can be subdi- 
vided into distinct groups with meaningful differences 
between them — as easily debunked nonsense. But this 
ignores howthese ideas manifest in the real world. They can 
only be truly understood as age-old intellectual threads, 
embedded in politics as much now as ever. 

In failing to recognize that science can be political, the 
scientific community allows the resurrection of dangerous 
ideas. Acting as if theories — especially those about humans 
— exist in cultural or political vacuums is a ridiculous 
fallacy. 

The UCL inquiry was prompted in part by 2018 revela- 
tions that anow-former honorary fellow had been booking 
meeting space for secretive conferences discussing race 
and eugenics. Many people — even members of the inquiry 
committee — are concerned that the investigation did not 
go far enough in connecting the pseudoscience of the past 
with ongoing attempts to keep that pseudoscience alive. 

In the same month that UCL released its report, news 
broke that Dominic Cummings, aself-proclaimed science 
enthusiast and special adviser to the UK prime minister, 
had hired an aide who espouses eugenicist views. Now 
resigned, Andrew Sabisky had suggested compulsory con- 
traception to halt the growth of a “permanent underclass”. 

When a survey conducted as part of the UCL inquiry 
asked staff and students whether “we should separate 
science and politics”, it found agreement among higher 
percentages of those in the sciences and engineering than 
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in the social sciences and history. In my coverage of the 
inquiry, I’ve seen that it was not the university's biologists, 
but its humanities scholars — including curator Subhadra 
Das and historian Joe Cain — who forced their workplaceto 
confront a sordid history that some geneticists had been 
willing to overlook. 

“If the past is to be called upon to legitimize the present, 
asit so frequently is, then the veracity of sucha past has to 
be continuously vetted,” writes Romila Thapar, ahistorian 
at Jawaharlal Nehru University in New Delhi. Those who 
seek to bring back eugenics prefer to gloss over the past 
and treat the concept as a purely scientific proposition. 
This is made easier by scientists who are willing to separate 
science from politics, even when — as with eugenics — it 
was inherently political to begin with. 

The UCL survey also revealed that members of the 
university community who are disabled or from minority 
ethnic backgrounds are more likely to feel that the legacy 
of eugenics is still present than are those who are white and 
not disabled. The people onthe receiving end of the world- 
view that drove eugenics understand howalive it remains. 

Scientists who imagine that bias lies in others, not them- 
selves, fail to recognize that to live in the world today is to 
be drip-fed assumptions and prejudices that guide our 
thoughts and actions. If it were any other way, the demo- 
graphics of academia would be more equitable, and the 
current strain of genetic determinism in governments 
wouldn’t be possible. Racism and prejudice are woven into 
the structures in which we all live and work — andinto us. 

A lack of education means we fail to draw these links. 
At a philosophy festival last September, I spoke about 
non-European cultures and their contributions to science 
and mathematics. One scientist remarked that he had no 
need to know about what had been done in ‘bongo bongo’ 
land. The audience was shocked; | was disappointed. He 
was a product of a system that had not taught him that 
he needed to know better. It is this system we need to fix. 

Scientists rarely interrogate the histories even of their 
own disciplines. When I studied engineering at university, I 
was expected to write just one essay on ethics in four years. 
No wonder that new technologies perpetuate racial and 
gender stereotypes, or that automated facial recognition 
struggles to identify people with darker skin. 

The best research is done not when we pretend that 
we are perfectly objective, but when we acknowledge 
that we are not. The UCL inquiry report recommends that 
students and staff be exposed to the history of eugenics, 
and that students be encouraged to value the history of 
their own fields. I would go further. Scientists need both 
history and the social sciences to develop the intellectual 
tools to think critically about their research and how it 
affects society. This isn’t just helpful — it’s vital. 
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CLIMATE CHANGE CONTRIBUTEDTO 
AUSTRALIA'S EXTREME BUSH-FIRE WEATHER 


Global warming increased 

by at least 30% the risk of the 
weather conditions that drove 
the recent catastrophic bush 
fires in Australia, says a group of 
climate scientists who analysed 
the disaster. 

Australia experiences bush 
fires regularly, but the latest 
event was unprecedented inits 
severity and scale. 

The World Weather 
Attribution (WWA) project 
sought to measure climate 
change’s contribution to the 
bush-fire conditions seenin 
southeast Australia. Fires inthe 
region were particularly severe, 
and killed dozens of people and 
destroyed thousands of homes. 

The group assessed bush- 
fire conditions with an index 
that tracks ‘fire weather’. This 
calculates the chance of fire ina 
location on the basis of variables 
such as temperature, humidity, 
wind and rainfall. The analysis 
did not consider non-weather 
factors, such as howa fire 
started. 

“This is a highly conservative 
assessment,” says David 
Karoly, a climate scientist 
based in Melbourne at the 
Commonwealth Scientific 
and Industrial Research 
Organisation, who was not 


involved in the analysis. 

The project’s researchers 
also say that the result is 
conservative. Models have 
mostly underestimated the 
rise in temperatures that 
has been observed since the 
Industrial Revolution, says 
Geert Jan van Oldenborgh, a 
climate researcher at the Royal 
Netherlands Meteorological 
Institute in De Bilt, the 
Netherlands, and co-author of 
the WWA analysis. 

The team also examined 
whether climate change 
had influenced two of the 
components that are used to 
measure fire weather: extreme 
temperatures and drought. The 
results, which were posted on 
the group’s website and have 
not been peer reviewed, suggest 
that human activity doubled the 
chance of heatwave conditions 
during the fires, but do not show 
that climate change contributed 
to the extremely dry conditions 
that Australia experienced. 

Climate change definitely 
played a part in the fires, says 
Andy Pittman, aclimate scientist 
at the University of New South 
Wales in Sydney, Australia. But 
he questions whether the results 
are meaningful, because models 
struggle to simulate fires. 
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EU CHAMPIONS BOLD 
NEW CLIMATELAW 


The European Commission has 
proposed alaw that would give 
it far-reaching power to dictate 
the course of political action on 
climate change in the European 
Union. 

The draft climate law, 
unveiled in Brussels on 4 March, 
would create a legally binding 
commitment for the EU to 
reduce its greenhouse-gas 
emissions to net zero by 2050 
— which means any emissions 
would have to be offset by 
greenhouse-gas uptake, for 
example by trees or through 
carbon capture and storage 
technologies. 

The proposal would also 
give the commission power 
to set binding short-term 
climate targets that don’t need 
unanimous approval from 
all 27 member states. Policy 
analysts say that some countries 
could strongly oppose these 
measures, so the current draftis 
unlikely to be approved without 
substantial amendments. 

The draft does not mention 
specific EU emissions-reduction 
milestones between now and 
2050. Climate campaigners — 
including Swedish teenager 
Greta Thunberg (pictured, right, 
with commission president 
Ursula von der Leyen) — say the 
law doesn’t go far enough. “We 
don’t just need goals for 2030 or 
2050. We, above all, need them 
for 2020 and every following 
month and year tocome,’a 
group of 34 young activists 
wrote in an open letter to EU 
leaders. 
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CORONAVIRUS: 
CHILDREN AS 
SUSCEPTIBLEAS 
ADULTS 


Children are just as likely as 
adults to become infected with 
the new coronavirus, according 
to a detailed study onthe spread 
of the virus. The analysis, based 
on data from Shenzhen in China, 
provides a partial answer toa 
key question surrounding the 
outbreak: the role of children. 

Previous studies have 
suggested that children are less 
likely than other age groups to 
develop severe symptoms when 
infected by the coronavirus. But 
it was not clear whether this was 
because they weren't getting 
infected or because they were 
fighting off the infection more 
effectively. 

“Kids are just as likely to 
get infected,” says Justin 
Lessler, an infectious-disease 
epidemiologist at Johns Hopkins 
Bloomberg School of Public 
Health in Baltimore, Maryland, 
who co-led the study, which was 
posted to the medRxiv preprint 
server on4 March (Q. Bietal. 
http://doi.org/dpf9; 2020). 

The researchers followed 
391 people diagnosed with 
coronavirus, and 1,286 of their 
close contacts to see whether 
these contacts tested positive 
for the virus even if they didn’t 
show symptoms. Overall, the 
team found that children under 
10 who had potentially been 
exposed to the virus were just 
as likely to become infected as 
other age groups, with between 
7% and 8% of contacts of known 
cases later testing positive. 

The findings could influence 
measures intended to halt the 
spread of the virus. “This is a key 
piece of data that may support 
school closures as an effective 
intervention,” Caitlin Rivers, 
an epidemiologist at Johns 
Hopkins Bloomberg School of 
Public Health, said ina tweet on 
5 March. 
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This thin slice of rock, viewed using a microscope, is 
part of asample taken from the Atlantis Bank at the 
bottom of the Indian Ocean, where researchers have 


found microbes living deep within Earth’s oceanic crust. 


A team led by marine microbiologist Virginia 
Edgcomb at Woods Hole Oceanographic Institution 
in Massachusetts identified a range of bacteria,fungi 
and archaea that inhabit hairline cracks in rocks up to 
750 metres below the ocean floor, where they are able to 
live and grow despite extremely limited resources (J. Li 
etal. Nature 579, 250-255; 2020). By measuring enzyme 
activity and gene expression, the researchers showed 
that these microbial communities have adapted to their 
conditions by maintaining low levels of cellular activity 
and feeding on carbon from fragments of amino acids 
and other organic molecules carried by deep ocean 
currents. 


TINY DINOSAUR 
PRESERVED IN AMBER 
FOR 100 MILLION 
YEARS 


A creature exquisitely preserved 
in amber for 100 million years is 
the smallest known dinosaur of 
its era. 

The animal’s skull, described 
this week in Nature, is less 
than 2 centimetre long — and 
suggests the dinosaur was about 
the size of a bee hummingbird 
(Mellisuga helenae), the 
smallest living bird (L. Xing 
et al. Nature 579, 245-249; 
2020). Researchers put the 
new dinosaur in a genus they 
called Oculudentavis, meaning 
‘eye-teeth bird’. “It reveals 
to usa whole new lineage of 
birds,” says Jingmai O’Connor,a 
palaeontologist at the Chinese 
Academy of Sciences Institute 
of Vertebrate Paleontology and 
Paleoanthropology in Beijing, 
who co-led the study. 

The fossil, which was found in 
Myanmar and comes from the 
Mesozoic era, is exceptionally 
well preserved for aspecimen 
of its size. Its beak is crammed 
with dozens of sharp teeth, 
suggesting that in life, the 
creature preyed on insects 
and other small invertebrates. 
Its eyes protrude from either 
side of its skull so, unlike most 
modern predators, this dinosaur 
did not have binocular vision. 
And its size and age mean 
that miniaturization in birds 
occurred earlier than scientists 
previously thought. 
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Medics tend to a patient at the Life Care Center of Kirkland, a facility linked to several of Washington state's confirmed coronavirus cases. 


THE RACE TO UNRAVEL THE 
BIGGEST CORONAVIRUS 
OUTBREAK IN THE UNITED STATES 


As cases in Washington state soar, virologists are working around the clock 
to diagnose cases, reveal routes of transmission and test treatments. 


By Amy Maxmen 
in Seattle, Washington 


ohit Shankar left the virology 
laboratory at 2 a.m. one night last 

week, and was back at the lab bench 

by 7 a.m. the same day. “It’s okay,” he 

says, “I hada doughnut and a coffee.” 
Shankar, a medical scientist, and his col- 
leagues at the University of Washington in Seat- 
tle are poised to trigger an exponential rise in 
the number of cases of the coronavirus disease 
COVID-19 confirmed in and around the city, 
in western Washington state. That’s because 


last week, they began analysing a mountain of 
nose and throat swabs collected from hospi- 
tals. Already, the researchers are seeing clear 
signs that the virus has infected vastly more 
people than have been formally detected. 
Washington state has become the United 
States’ ground zero for COVID-19, which has 
now spread to more than100 countries. Wash- 
ington has declared a state emergency, and ten 
people there have died from the disease. But 
thenumber of confirmed cases in Washington 
— around 160 as Nature went to press — is an 
underestimate resulting froma lack of testing, 
researchers agree. A genomic analysis posted 
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online on 29 February suggested that hun- 
dreds of people in western Washington might 
be infected (see go.nature.com/3clwou3). Aca- 
demic scientists have mostly been prevented 
from measuring the extent of the US outbreak 
because of federal restrictions onthe number 
of labs qualified to run diagnostic tests. But 
that is changing. 

Dozens of virologists and genomicists have 
now kicked into high gear in Seattle, dropping 
or adapting projects to devote resources to 
the outbreak. Researchers are working around 
the clock to find out how many people have 
the disease in the area. Others are analysing 
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genomes to reveal how the virus is transmit- 
ted, or developing new therapies. The scien- 
tists are racing to help Washington avoid the 
fate of Hubei province in China, where more 
than 3,000 people have died of COVID-19 so 
far. Thecoronavirus emerged in the province’s 
city of Wuhan in December, and the initial 
response from officials was slow. 

“Weare past the point of containment,” says 
Helen Chu, aninfectious-disease specialist at 
the University of Washington School of Med- 
icine (UW Medicine) in Seattle. “So now we 
need to keep the people who are vulnerable 
from getting sick.” 


Working in shifts 


“| remembered to eat around 10 p.m. last 
night,” says Keith Jerome, director of the clin- 
ical virology department at UW Medicine, 
nodding to an empty pizza box on a confer- 
ence-room table. InJanuary, his group quickly 
adapted a PCR test described by the World 
Health Organization that identifies snippets 
of the virus’s genome sequence. 

But when COVID-19 reached the United 
States, his team couldn’t check the accuracy 
of its test — done by analysing samples from 
people known to have the disease — because 
of regulations set by the US Food and Drug 
Administration (FDA) to ensure that tests 
are accurate before clinics rely on them. But 
researchers became impatient with the pace 
of FDA processes as the outbreak hit home. 
On 29 February, the agency announced that 
it would allow certain academic labs to test 
people forthe disease — opening Jerome’s lab 
toa flood of samples. 

The lab is now helping hospitals and the 
state to clinically diagnose cases of COVID-19. 
Jerome estimates that with its high-through- 
put molecular-biology equipment, the group 
can screen around 1,000 samples per day — 
roughly 5 times as many as the state depart- 
ment of health’s lab. “That’s why it’s such a 
problem to not have labs like this involved in 
the early days of an outbreak,” Jerome says. 
Now thatthey’re rolling, the researchers have 
plans to bulk up their capacity to 4,000 sam- 
ples daily. Jerome says they’re quadrupling 
their equipment thanks to university funds, 
asking other labs in the city to lendthem PCR 
machines, and adding more researchers to 
their ranks. “We’re making plans for different 
shifts around the clock so that we don’t burn 
people out,” he says. 

Because the test is still new, the lab sends 
positive samples to the state lab for confirma- 
tion. Then the state alerts the US Centers for 
Disease Control and Prevention (CDC), which 
is responsible for official US case counts. 


Undetected spread 


Meanwhile, in Chu’s lab, boxes containing swab 
kits balance precariously on lab benches and 
carts. Her group has just received the green 
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light from the FDA and the CDC to analyse 
nose swabs for the coronavirus. It’s for a sci- 
entific study, rather thanas part of the state’s 
public-health response. Still, the researchers 
act on the results in real time. If samples test 
positive, they notify the health department. 
The project stems fromthe multi-institution 
Seattle Flu Study, which Chu has co-led since 
2018. In that study, participants who feel as 
if they have a cold or influenza swab their 
nose and send the sample to the lab, where 


“Our emergency rooms 
are being flooded today 
by people saying they 
wantatest.” 


researchers sequence any influenza-virus 
genomes they contain. Analyses of these 
genomes reveal the trail that the flutakes as it 
passes around households, homeless shelters, 
offices and communities in the city. 

As COVID-19 overwhelmed Chinain January, 
the Seattle Flu Study researchers felt sure that 
the epidemic would soon spreadtothe United 
States — and that they should track that, too. 
So they customized a genetic test, similar in 
design to what they had used for flu viruses, 
so that it could identify genetic sequences of 
the new coronavirus. On 27 February, soon 
after they had received clearance to use it, 
their ‘COVID panel’ detected the virus ina 
swab from a teenager north of Seattle. His 
case would prove pivotal because he hadn’t 
travelled internationally. The team alerted 
the county health department, and set about 
sequencing the whole genome of the virus. 

After barely 24 hours, the researchers 
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Helen Chu is studying coronavirus spread. 
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had sequenced the genome. They posted 
the sequence to an online platform called 
GISAID. Then a collaborator at Nextstrain, 
an online project that visualizes the spread 
of viruses through genomic analyses, com- 
pared the genome with dozens of others that 
had been sequenced around the world. Ina 
series of tweets on 29 February, Nextstrain’s 
co-founder, computational biologist Trevor 
Bedford, explained his findings. The sequence 
contained an unusual genetic variation that 
matched that of the virus from the first per- 
son reported to have COVID-19 in the United 
States, on 20 January — amantreated ata hos- 
pital north of Seattle. This meant that the virus 
had probably been circulating around western 
Washington for six weeks. Bedford calculated 
that in that time frame, up to 1,500 people 
could have been infected. 

In the days that followed, officials in Seat- 
tle reported a rising number of deaths from 
COVID-19 among older people who hadn't trav- 
elled. Yet what is lacking, says Cassie Sauer, 
chief executive of Washington State Hospi- 
tal Association, are diagnostic tests. “Our 
emergency rooms are being flooded today 
by people saying they want a test,” she says. 
Even with the added power at the University 
of Washington, supplies are limited. 


‘Spending money like it’s nothing’ 
Some of this burden could soon be relieved as 
the Seattle Flu Study changes course. People 
who feel ill but aren’t in dire need of hospital 
care can now order a swab kit from the study 
website. “We’re ramping up to screening 
570 tests a day, and hope to do 1,000 per day 
ina few weeks,” says Lea Starita, agenomicist 
on the project at UW Medicine. She’s scram- 
bling to find researchers to join the lab and 
is purchasing new tools. “I’ve been spending 
money this week like it’s nothing,” she says, 
listing US$390,000 in equipment and reagents 
off the top of her head. 

Asthe swab kits pile up in Chu’s lab, she men- 
tions more than 2,500 samples collected this 
year for the flu study. Some might contain the 
new coronavirus, and genomic analyses could 
reveal how it circulated undetected around 
Seattle. But with the ever-mounting workload, 
there’s notime to analyse them now. Chuis also 
involved in an effort to isolate antibody pro- 
teins from people with COVID-19, inthe hope 
that researchers can developa treatment. And 
she’s leading a clinical trial at UW Medicine to 
see whether the experimental antiviral drug 
remdesivir could be used to treat the disease. 

Chu doesn’t spare a thought for the work 
she’s put on hold. She’s in triage mode, prior- 
itizing the most urgent questions. “As diagnos- 
tic testing ramps up, it will become clear that 
this is everywhere,” she says. Seattle’s scien- 
tists might help Washington to mitigate the 
outbreak’s harm to lives and the economy, and 
provide models for other states and countries. 
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MONKEYS AND MICE 
ENLISTED 10 FIGHT 


CORONAVIRUS 


Animal models can reveal how infections 
develop, and aid drug and vaccine efforts. 


By Ewen Callaway 


ith no sign that the coronavirus 

is going away, researchers are 

looking to animals to understand 

COVID-19. They are testing mon- 

keys, mice and even ferrets to 
answer key questions about the disease and 
to fast-track potential drugs and vaccines for 
clinical trials. 

Teams in China have reported initial findings 
from studies in which they infected monkeys 
and mice engineered to be susceptible to 
infection by the coronavirus, called SARS- 
CoV-2. And a team at the Australian Animal 
Health Laboratory in Geelong is studying the 
infection in ferrets, before testing potential 
vaccines. Ferrets are a popular model for 
respiratory infections because their lung 
physiology is similar to humans’. 

But no animal model is perfect. “There’s 
going to be a need not just for one animal 
model, but multiple,” says David O’Connor, 
a virologist at the University of Wisconsin- 
Madison. 


Mildillness 


O’Connor and fellow University of Wisconsin 
virologist Thomas Friedrich are part of a net- 
work of about 60 scientists who are sharing 
details of their efforts to study theinfectionin 
primates and other animals. They were excited 
to read about experiments in non-human pri- 
mates infected with COVID-19, reported ina 
preprint on 27 February (C. Shan etal. Preprint 
at Research Square https://doi.org/10.21203/ 
rs.2.25200; 2020). 

That research, led by virologist Chao Shan 
at the Chinese Academy of Sciences Wuhan 
Institute of Virology, found that rhesus 
macaques infected with the coronavirus had 
a fairly mild illness. None developed fevers, 
but X-rays of their lungs showed signs of 
pneumonia similar to those in humans with 
COVID-19. The researchers also monitored 
two animals for three weeks; these monkeys 
lost weight, but didn’t have other serious 
symptoms. 

The fact that the monkeys seemed to 
develop symptoms similar to those in people 
with mild forms of COVID-19 is an important 
takeaway, says O’Connor. But to find models 
for severe human infections, researchers will 


Rhesus macaques are used to study the virus. 


have to look at different animals and vary 
experimental factors such as the route by 
which the virus is administered, he adds. 

Many researchers are turning to lab mice to 
test drugs and vaccines, andto investigate the 
nature of the infection. But ordinary miceseem 
to be resistant to infection by SARS-CoV-2. So 
researchers are hoping to use mice bred to 
produce a human version of the protein ACE2, 
which the virus uses to enter cells. 

One lab has already begun infecting them 
with coronavirus. A team of researchers in 
China found that, like rhesus monkeys, the 
mice seemed to develop only mild illness, 
showing weight loss and signs of pneumonia 
but nothing more severe (L. Baoetal. Preprint 
at bioRxiv http://doi.org/dph2; 2020). Qin 
Chuan, a virologist at Peking Union Medical 
College in Beijing who co-led the study, says 
that, in unpublished work, his team also 
identified several drugs that slowed the 
virus’s replication and limited the animals’ 
weight loss. 

Animals that develop mild infections could 
be useful for testing drugs and vaccines, says 
Stanley Perlman, a coronavirologist at the 
University of lowa in lowa City whose lab 
developed an ACE2 mouse strain. But he’s 
thinking about developing other mouse 
models to better mimic severe cases. “Models 
are imperfect; we do the best we can,” 
Perlman says. 


© 2020 Springer Nature Limited. All rights reserved. 


Why does the 
coronavirus 
spread so easily? 


A microscopic feature could make the 
virus more infectious than the SARS virus. 


As the number of coronavirus infections 
passes 100,000 worldwide, researchers are 
racing to understand what makes it spread 
so easily. 

A handful of genetic and structural 
analyses have identified a key feature 
of the virus — a protein on its surface — 
that might explain why it infects human 
cells much more readily than does the 
coronavirus that causes severe acute 
respiratory syndrome, or SARS. 

Other groups are investigating 
the doorway through which the new 
coronavirus enters human tissues — a 
receptor on cell membranes. The cell 
receptor and the virus protein offer potential 
targets for drugs to block the pathogen, 
but researchers say it is too early to be sure. 

To infect a cell, coronaviruses use a ‘spike’ 
protein that binds to the cell membrane. 
Genomic analyses of the new coronavirus 
have revealed that its spike protein differs 
from those of close relatives, and suggest 
that the protein has a site that is activated 
by an enzyme called furin. 

This is significant because furin is found 
in many human tissues, including the 
lungs, liver and small intestine, which 
means that the virus has the potential 
to attack multiple organs, says Li Hua, a 
structural biologist at Huazhong University 
of Science and Technology in Wuhan, 
China, the city where the outbreak began. 

The finding could explain some of the 
symptoms observed in people with the 
coronavirus, such as liver failure, says Li, 
who co-authored a genetic analysis of the 
virus (H. Li et al. Preprint at http://chinaxiv. 
org/abs/202002.00062; 2020). 

Other groups have also identified the 
activation site as possibly enabling the 
virus to spread easily between humans. 

But some researchers are cautious about 
overstating the role of the site. “We don't 
know if this is going to be a big deal or not,” 
says Jason McLellan, a structural biologist 
at the University of Texas at Austin, who 
co-authored another structural analysis 
(D. Wrapp et al. Science http://doi.org/ 
ggmtk2; 2020). 

By Smriti Mallapaty 
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News in focus 


Litigant Nathan Baring speaks during a meeting of the US Senate Climate Change Task Force. 
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CLIMATE LAWSUITS 
BREAK NEW GROUND T0 
PROTECT THE PLANET 


Despite recent defeats, activists are optimistic that 
courts will provide relief from climate change. 


By Giuliana Viglione 


wenty-year-old Nathan Baring is a 
third-generation Alaskan. Within his 
lifetime, he has seen winters shorten, 
cod fisheries collapse and cultural tra- 
ditions suffer. He grieves for an Arctic 
that is disappearing before his eyes. “There 
is avery distinct loss of place here,” he says. 

Baring decided to take action. He is one of 
21 young plaintiffs who, in 2015, filed alawsuit 
seeking to force the US government to reduce 
greenhouse-gas emissions. A federal appeals 
court dismissed the case, known as Juliana 
v. United States, in January. 

But other attempts to fight climate change 
inthe courts have been more successful. Two 
weeks ago, the UK Court of Appeal blocked 
plans to builda third runway at Heathrow Air- 
port, saying that allowing the expansion would 
violate the country’s obligations to the Paris 
climate agreement. 

Such decisions are inspiring and instruct- 
ing activists and municipalities around the 
world. But as litigants fight scores of such 
cases, one thing is clear. “Thereis no silver litig- 
ation bullet for climate change,” says Michael 
Gerrard, director of the Sabin Center for 


184 | Nature | Vol579 | 12 March 2020 


Climate Change Law at Columbia University 
in New York City. 

On18 February, the International Bar Asso- 
ciation released a model for how to litigate cli- 
mate change, laying out legal arguments and 
precedents that might help future plaintiffs. 

What is likely to succeed seems to vary 
around the world. Andas plaintiffs learn from 
their experiencesin the courts, they are adjust- 
ing their tactics. The Heathrowcase is the first 


major ruling based onthe Paris agreement and 
could spur more suits that rely on those obli- 
gations. In other parts of the world, plaintiffs 
are increasingly focusing on seeking damages 
from polluters themselves. 

The Juliana case is one of more than two 
dozen brought around the world onthe basis 
of ‘public trust’ arguments, which say that the 
state has a duty to protect public resources 
from harm. Such arguments are closely linked 
to the idea that the fundamental right to life 
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is inextricably tied to a healthy environment. 

Although the US court found that the plain- 
tiffs had been harmed by the government’s 
inaction on climate change, the judges ulti- 
mately ruled that it was not within the court’s 
power to legislate climate policy. 

But outside the United States, the human- 
rights approach is the legal strategy that has 
hadthe most success in forcing governments’ 
hands, says John Knox, an expert on inter- 
national environmental law at Wake Forest 
University in Winston-Salem, North Carolina. 

In Urgenda Foundation v. State of the Neth- 
erlands, acase brought in 2015 by an environ- 
mental group and nearly 900 Dutch citizens, 
the Dutch supreme court mandated that the 
government achieve a 25% reduction in green- 
house-gas emissions from 1990 levels by the 
end of 2020 to protect its citizens from the 
harms ofa warming climate system. 

And in Demanda Generaciones Futuras 
v. Minambiente, Colombia’s Supreme Court 
ordered the government to implement pro- 
tective measures to halt deforestation in the 
Amazon — that case was brought by 25 young 
Colombians. Like Juliana, both cases rested on 
the idea that the right to life is endangered by 
threats to the environment. 

Their success suggests that we can expect to 
see more suits brought by citizens against their 
governments, says Knox. Meanwhile, climate 
activists are watching closely tosee howthese 
governments comply with the court-ordered 
actions. 


Damage limitation 


In the United States, the scope of Juliana — 
which sought to force the US government 
to not only stop permitting and subsidizing 
fossil-fuel use, but also implement a plan for 
reducing atmospheric carbon dioxide levels — 
meant it was “always going to be a long shot”, 
says Ann Carlson, who studies environmental 
law at the University of California, Los Ange- 
les. And given the increasingly conservative 
make-up of many US federal courts, Carlson 
thinks it is unlikely that future cases based on 
similar arguments will succeed. 

Instead, Gerrard and Carlson expect more 
US activists and municipalities to pivot from 
targeting the government to suing the pro- 
ducers of emissions themselves. This strategy 
is seen as more pragmatic because cash pen- 
alties can, insome cases, be used for climate 
mitigation. 

In County of San Mateo v. Chevron Corp., sev- 
eral Californian cities and counties are seeking 
funds from major fossil-fuel corporations to 
fund infrastructure for adaptation to sea-level 
rise. Oral arguments in the latest appeal were 
heard on 5 February, but a ruling has yet to 
be handed down. Several other individuals 
and localities, including the state of Massa- 
chusetts, are currently suing ExxonMobil 
and other companies for allegedly deceiving 
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consumers about the risks of fossil-fuel use. 

Similar arguments for compensation are 
also being made outside the United States. In 
November 2015, a Peruvian farmer named Satl 
Lliuya brought a case in German courts against 
the German utilities company RWE, the largest 
emitter of CO, in the European Union. 

Lliuya, who lives near a glacial lake, alleges 
that RWE’s emissions are partially responsible 
for the dangerously high water levels seen at 
the lake as nearby glaciers have melted. He is 
seeking 0.47% of the costs of flood-protection 
measures for his town, equal to RWE’s propor- 
tion of global CO, emissions from 1751 to 2010. 

The case was initially dismissed, but an 


appeals court has since ruled that Lliuya’s 
complaint was admissible, and the court has 
ordered the parties to submit expert evidence 
— the first time sucha case has moved to the 
evidentiary stage. Gerrard says a victory could 
spur similar lawsuits around the world. 
Although the recent ruling inJuliana was 
disappointing, the plaintiffs say that they are 
heartened by the court’s finding that they had 
been harmed by the government’s inaction 
on climate change. They are now preparing 
to appeal, and are optimistic that they will get 
achance to argue their case in front of ajury. 
“We have many paths forward,” Baring says. 
“Thisis certainly not the end of theroad for us.” 


CRISPR TREATMENT 
INSERTED DIRECTLY INTO 
BODY FOR THE FIRST TIME 


Experiment tests a gene-editing therapy 
for ahereditary blindness disorder. 


By Heidi Ledford 


person with a genetic condition that 
causes blindness has become the 
first to receive a CRISPR-Cas9 gene 
therapy administered directly into 
their body. 

The treatment is part of a landmark 
clinical trial to test the ability of CRISPR- 
Cas9 gene-editing techniques to remove 
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mutations that cause a rare condition called 
Leber’s congenital amaurosis 10 (LCA10). 
No treatment is currently available for the 
disease, whichis a leading cause of blindness 
in childhood. 

For the latest trial, the components of 
the gene-editing system - encoded in the 
genome of a virus — are injected directly into 
theeye, near photoreceptor cells. By contrast, 
previous CRISPR-Cas9 clinical trials have used 
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The human retina. A CRISPR therapy has been inserted directly into a person’s eye. 
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the technique to edit the genomes of cells 
that have been removed from the body. The 
material is theninfused back into the patient. 

It’s a significant jump from treating cells 
in a dish, says Fyodor Urnov, who studies 
genome editing at the University of California, 
Berkeley. “It is akin to space flight versus a 
regular plane trip,” he says. “The technical 
challenges, andinherent safety concerns, are 
much greater.” 

“It’s an exciting time,” adds Mark Pennesi, 
a specialist in inherited retinal diseases 
at Oregon Health & Science University in 
Portland. Pennesi is collaborating with the 
pharmaceutical companies Editas Medicine 
of Cambridge, Massachusetts, and Allergan 
of Dublin to conduct the trial, which has been 
named BRILLIANCE. 


Mutation eradication 


This is not the first time gene editing has been 
tried inthe body: an older gene-editing system, 
called zinc-finger nucleases, has already been 
administered directly into people participat- 
ing in clinical trials. Sangamo Therapeutics of 
Brisbane, California, has tested a zinc-finger- 
based treatment for a metabolic condition 
called Hunter’s syndrome. The technique 
inserts a healthy copy of the affected gene 
into aspecific location in the genome of liver 
cells. Although it seems to be safe, early results 
suggest it might dolittle to ease the symptoms 
of Hunter’s syndrome. 

But the BRILLIANCE trial is the first to deploy 
the popular CRISPR-Cas9 technique — which 
has been hailed for its versatility and ease of 
design — directly in the body. In BRILLIANCE, 
gene editing is used to delete a mutation in 
the gene CFP290 thatis responsible for LCA10. 

The condition is a particularly attractive 
target for a gene-editing approach. Conven- 
tional gene therapies use a virus to insert a 
healthy copy of the mutated gene into affected 
cells. But CEP290is too large to slip the entire 
gene into aviral genome, says Artur Cideciyan, 
who studies retinal diseases at the University 
of Pennsylvania in Philadelphia. 

Andalthough mutations in CEP290 disable 
light-sensing cells called photoreceptors in 
the retina, the cells are still present and alive 
in people with LCA10. “The hope is that you 
can reactivate those cells,” says Pennesi. “This 
is one of the few diseases where we think 
you could actually get an improvement in 
vision.” 

Early results from another therapy suggest 
that this might be the case. Cideciyan 
has teamed up with ProQR of Leiden, the 
Netherlands, to treat people withLCA10 using 
an experimental treatment called sepofarsen. 
Early results suggest that sepofarsen, which 
uses a technique called antisense therapy to 
correct an LCA10-causing mutation in RNA 
made from the CEP290 gene, can improve 
vision in people with LCA10. 
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Feature 


TURBOCHARGIN 


rom the start, Mat Risher swore 
that dialysis wouldn't upend his life. 
He had been working at a software 
company, conducting research on 
a car-racing simulator, when kidney 
damage from lupus forced him to 
start the blood-filtering treatments 
three times a week. 

Five years have slipped by, and the sessions 
have sapped his resolve. The 33-year-old now 
works part-time. On good days, he enjoys trying 
outnewrecipes. On bad days, his lupus flares up 
and the strain of incessant dialysis leaves him 
drained. “The times in between [dialysis days], 
Ihave no social life, no dating life,” says Risher, 
who lives just outside Seattle, Washington. “I 
have become arecluse in my room.” 

Risher is relatively fortunate; he has access 
to treatment, whereas up to seven million 
people could die each year without getting 
such care’. But Risher, amember of a patient 
advisory board at Seattle’s Center for Dialysis 
Innovation (CDI), is impatient for a more live- 
able option than dialysis — which has remained 
largely the same for 50 years. 

Walk into any facility, says CDI co-director 
Buddy Ratner, and you'll find a big machine at 
the bedside of every person undergoing dialy- 
sis. “These days it’s going to have LCD screens 
and modern controls,” he says. “But look at the 
pictures in the 1960s of those machines. They 
look rather similar to what we're doing today.” 
Survival has increased, but still, just 42% of US 
patients receiving the most common form of 
treatment, known as haemodialysis, live even 
for five years — shorter than for many cancers. 

Ratner is among an international cadre of 
physicians, bioengineers and entrepreneurs 
whoareworking to revolutionize treatment for 
kidney failure, designing devices that are port- 
able enoughto carry into work or strap around 
the waist. Some are even developing artificial 
kidneys that could be surgically implanted. 

The complexities remain daunting. Dialysis 
poorly mimics the sophistication of the human 
kidney, and improved and more portable ver- 
sions will need miniaturized components and 
a substantial reduction inthe amount of water 
required. Any approaches that make use of 
biological materials will face steep regulatory 
hurdles, too. 

But a new wave of funding is helping to 
reverse the years of stagnation. Last year, US 
President Donald Trump issued an executive 
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order on kidney health, including strategies 
to reduce the shortage of kidneys available 
for transplantation, encourage more dialysis 
at home and incentivize researchinto artificial 
kidneys through a partnership called KidneyX. 
The partnership is led by the US government 
and the American Society of Nephrology 
and plans to raise US$250 million over the 
next five years. Last year, it awarded a total 
of $1.1 million to 15 US-based research teams 
tackling various pieces of the dialysis puzzle, 
including groups pursuing wearable dialysis 
devices and bioengineered kidney grafts. 

Around the world, clinical trials of porta- 
ble devices are advancing, and researchers 
are finalizing a low-tech approach that they 
hope will reach regions of the world where 
clean water is unreliable and dialysis is scarce. 

All these efforts are a drop in the ocean 
compared with the hefty bill to treat people 
living with end-stage kidney disease — at least 
US$35 billion annually in the United States 
alone. But the field is bullish. John Sedor, a 
nephrologist at the Cleveland Clinic in Ohio, 
who chairs the KidneyX steering committee, 
predicts thatamuch more portable device will 
be available inthe next five years, and the first 
wearable device in the next decade. “I think 
this isa remarkable time and we're ata tipping 
point in our field,” he says. 

That innovation is long overdue, says 
Valerie Luyckx, anephrologist at Graubiinden 
Cantonal Hospital in Switzerland who 
researches the global burden of kidney dis- 
ease. Dialysis “is a multibillion-dollar industry, 
with multiple billions in profits since the early 
1960s”, she says. “And nobody has bothered to 
try toinnovate until all of the sudden there is 
research and grant funding for it.” 


Asmart organ 


The kidneys are complex and resilient organs, 
each roughly thesize of a fist. They filter some 
140 litres of blood each day, leaving behind 
a litre or two of water and waste in the form 
of urine. 

Each kidney features a latticework of roughly 
one million tiny filtering units, called nephrons. 
Blood entering anephron passes through aclus- 
ter of tiny vessels called the glomerulus. The 
thin walls of theglomerulus enable waste, water 
and other small molecules to pass through, 
while blocking larger ones such as proteins 
and blood cells. From there, the filtered fluid 
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flows into kidney tubules, where the balance of 
minerals, water, salts and glucoseis calibrated 
and molecules necessary for bodily functions 
are reabsorbed into the bloodstream. 

But many medical conditions can strain 
the kidneys, including diabetes, obesity and 
high blood pressure. And those conditions are 
becoming more common. By 2030, it’s pro- 
jected that 5.4 million people worldwide will 
be getting dialysis or a transplant, and many 
more will die without’. 

For haemodialysis, patients usually needto 
travel toa clinic, where they are connected to 
a machine weighing more than 100 kilograms 
that filters the patient’s blood through a 
semi-permeable membrane, designed to rep- 
licate the function of the glomerulus. Then 
a water-based dialysis solution is used to 
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IG DIALYSIS 


After decades of slow 
progress, researchers 
are Starting to test better 
treatments for people 
with failing kidneys. 

By Charlotte Huff 


rebalance the blood’s components and carry 
the toxins down the drain. Haemodialysis is 
much better at replicating the filtering than at 
achieving the subtle recalibration provided by 
thekidney tubules, says Jonathan Himmelfarb, 
the other co-director of the CDI. Healthy kid- 
neys make subtle adjustments around the 
clock, whereas patients get just 12 hours of 
dialysis across 3 sessions each week, he says. 
Rebalancing the blood so abruptly can bea 
shock to the body and take hours to recover 
from. This is dubbed dialysis washout. Risher, 
who takes a shuttle to dialysis, often falls 
asleep on the way home. 

The inefficient treatmentis also very costly, 
as much as $91,000 annually per patient in 
the United States. And as well as using alot of 
water, the current approach consumes vast 
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quantities of power and materials such as plas- 
tics. “We use massive amounts of water — it’s 
nota green therapy,’ Sedor says. 

Plus, access is patchy around the globe. 
No more than one-third of people in Asia 
get dialysis, and even fewer in Africa’ (see ‘A 
neglected need’). And even when patients in 
sub-Saharan Africa do begin the treatments, 
they are rarely able to sustain them for more 
than a few months’. 

The reason, in large part, is cost. Even 
when the government pays for the sessions, 
the patient’s family often has to foot the bill 
for lab tests, medications and other costs, 
says Gloria Ashuntantang, a nephrologist at 
Yaounde General Hospital in Cameroon. “Most 
of our patients will stop the therapy halfway, 
after having sold all property, andthe children 
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Kidney dialysis is inefficient and exhausting 
for patients such as this person in Yemen. 


having dropped out of school.” 

Still, there has been a lack of drive to 
improve the procedure, in part because the 
treatment has proved highly profitable for 
dialysis providers around the world, says 
Murray Sheldon, a physician and associate 
director fortechnology and innovation at the 
Center for Devices and Radiological Health at 
the US Food and Drug Administration (FDA). 
“They have acashcow. And there’s no need to 
do any innovation.” Dialysis companies chal- 
lenge that argument. Brad Puffer, a US-based 
spokesperson for Fresenius Medical Care in 
Waltham, Massachusetts, says that his com- 
pany is investing in improvements, including 
a haemodialysis device that incorporates a 
material designed to reduce blood clotting, 
a potential side effect that today’s recipients 
have to take medication to prevent. 


Kidney ina backpack 


One of the big problems with modern dialy- 
sis is that the machines require vast amounts 
of water: 120-180 litres for each 4-hour ses- 
sion, Himmelfarb says. “Obviously nobody 
can carry that around them because it would 
weigh tons.” There are a few in-home models 
marketed as portable: Fresenius sells a device 
that it says gives patients more mobility. It 
weighs 34 kilograms and can be used witha 
home tap, as long as the water meets certain 
quality standards. But the first priority in mak- 
ing dialysis more convenient is to remove the 
need for an external water supply. 

In Seattle, CDI researchers have developed 
a technique that pushes the used dialysis 
solution through a cartridge that uses light 
to convert urea — a key toxin targeted by dial- 
ysis — into nitrogen and carbon dioxide, so 
that the solution can be recycled’. The method 
can remove 15 grams of urea in 24 hours, suf- 
ficient for most people with kidney failure, 
and requires only 750 millilitres of solution, 
Himmelfarb says. 

The team’s standalone haemodialysis device 
could be made compact enough to fit inside 
arolling case, Himmelfarb says, weighing no 
more than 9 kilograms. Ideally, patients would 
use it daily, he says. 

Another group trying to downsize dialysis 
was recently formed by the Dutch Kidney 
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Feature 


ANEGLECTED NEED 


Chronic kidney disease kills more people each year than tuberculosis or HIV. Many cases are linked to hypertension 
and diabetes, which can be products of obesity, and a type of kidney inflammation called glomerulonephritis. 
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The number of people receiving treatment for kidney failure in the 
form of either dialysis or a transplant varies widely around the world. 
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Foundation, the medical-devices firm Debi- 
otech in Lausanne, Switzerland, and non- 
profit insurers. Its latest prototype, which it 
hopes to make available to patients by 2023, 
weighs about 10 kilograms and will require 
only 6 litres of solution, according to Ton 
Rabelink, a nephrologist at Leiden University 
Medical Center in the Netherlands who is on 
the medical advisory board of the company, 
called NextKidney. The device, which could 
be used at home, limits the quantity of dial- 
ysis solution needed by using an absorbent 
material to soak up the toxins, Rabelink says. 

In Singapore, researchers at the medi- 
cal-technology company AWAK have been 
testing an even lighter device, one that weighs 
nomorethan3 kilograms. It’s designed for peri- 
toneal dialysis, atechnique that usesa catheter 
tosend dialysis solutionintotheabdominal cav- 
ity, where a lining (the peritoneum) filters out 
toxins from the blood so they can drain, along 
with the solution, into an empty bag. 

The AWAK device relies ona pump andacar- 
tridge to absorb toxins from the used solution 
so that it can be recirculated. Each daily treat- 
ment would last seven to ten hours. 

The company completed asafety trial involv- 
ing 15 adults at Singapore General Hospital in 
2018. It reported no serious adverse events, 
although some patients experienced abdom- 
inal discomfort or bloating. The device is one 
of several more-portable products in devel- 
opment that the FDA has agreed to expedite 
through its ‘breakthrough devices’ programme. 

But testing a device in the controlled setting 
of a hospital is very different from using it in 
daily life, says Arshia Ghaffari, a researcher 
who directs dialysis services at the Univer- 
sity of Southern California in Los Angeles. 
Furthermore, it’s possible that the constant 
recirculation of dialysis solution could strain 
delicate membranes and “burn out the peri- 
toneum faster”, he says. A company spokes- 
man discounted that concern, saying that the 
fluid is recirculated in small increments, just 
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250 millilitres at a time. 

In some regions of the world, peritoneal 
dialysis is not an option, owing to the costs of 
shipping the heavy bags of solution. An inter- 
national competition led by the George Institute 
for Global Health in Camperdown, Australia, in 
2015 sought ways toimprove access. 

The winning technology, developed by Irish 
engineer Vincent Garvey, incorporates alight- 
weight kit that includes sterile bags contain- 


aa 


THAT WOULD BE SO 
MUCH BETTER THAN ANY 
DEVICE, TOHAVE YOUR 
OWN KIDNEY FUNCTION 
REGENERATED. 


ing a dry mix (dextrose and salts), along with 
awater distiller the size of a bread box, which 
sterilizes the water used to make the mix. A 
month’s worth of supplies could be shipped 
ina box weighing 3 kilograms — a big improve- 
ment over atypical day’s supply, which weighs 
8kilograms, says John Knight, managing direc- 
tor of Ellen Medical Devices in Camperdown, 
which was formed to develop the prototype. 
Knight’s goal is to complete a clinical trial by 
the end of next year. 


Recreating the kidney 

Researchers at the University of California, San 
Francisco (UCSF), and Vanderbilt University in 
Nashville, Tennessee, have bypassed external 
devices and instead focused on developing a 
kidney prototype that they hope will one day 
be surgically implanted into a patient’s body. 
It wouldn’t require a pump because it would 
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be attached to key arteries and powered by 
blood pressure, says Vanderbilt nephrologist 
William Fissell, who co-directs the research 
with UCSF’s Shuvo Roy. 

The device contains two key parts: a 
blood-filtration system and a cell-infused 
recalibration module. The filter is made of sil- 
icon membranes with nanometre-scale pores 
that are designed to mimic the glomerulus. 
The recalibration module uses tubule cells 
from discarded human kidneys to rebalance 
the blood’s components, Fissell says. 

Late last year, researchers reported at an 
American Society of Nephrology meeting 
that they had conducted the first safety test 
of the recalibration module in pigs, without 
any of the serious problems often seen with 
implanted devices, including an immune 
reaction or blood clots. 

But Rabelink thinks that implantable devices 
will be more difficult to develop, given that 
they rely ona mix of engineered and biological 
elements, complicating the design and creat- 
ing extra regulatory hurdles. Inthe meantime, 
he posits that advances in stem-cell research 
might surpass such efforts. “In the end, that 
would be so much better than any device, to 
have your own kidney function regenerated 
or prolonged,” he says. 

But Fissell and Roy counter that stem-cell 
techniques have been slow to pay off in other 
areas, such as diabetes treatment, so devices 
such as automated insulin pumps have led the 
way. Fissell describes the project’s primary 
hurdle as securing sufficient funding to man- 
ufacture the device, which is roughly the size 
of a soft-drink can, ona larger, standardized 
scale so it can be evaluated by US regulators. 
“I’ve got the [device] right on my desk — it’s 
ready to sew,’ he says. 

Despite the confidence of some teams, 
Sheldon thinks that recreating the sophisti- 
cation of akidney is too complex for any single 
team, and will probably require a mix of engi- 
neering and biology, plus alot more money. He 
proposed the idea of an international coalition 
at an American Society of Nephrology meeting 
last year, and has planned aseries of meetings 
in Europe later this year with stakeholders and 
medical groups. 

For Risher and other patients, access to any 
portable device would be liberating, provid- 
ing “that freedom and flexibility to do dialysis 
when I want to do it”, he says. As a car aficio- 
nado, he dreams of throwing his machine onto 
the passenger seat and steering for the open 
road, only the horizon before him. 


Charlotte Huff is a science journalist based in 
Fort Worth, Texas. 
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Setting the agenda in research 


Comment 


Acontrolled trial for 


reproducibility 


Marc P. Raphael, Paul E. Sheehan & Gary J. Vora 


For three years, part of DARPA 
has funded two teams for 
each project: one for research 
and one for reproducibility. 
The investment is paying off. 
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n2016, the US Defense Advanced Research 

Projects Agency (DARPA) told eight 

research groups that their proposals had 

made it through the review gauntlet and 

would soon get a few million dollars from 
its Biological Technologies Office (BTO). Along 
with congratulations, the teams received a 
reminder that their award came with an unu- 
sual requirement — an independent shadow 
team of scientists tasked with reproducing 
their results. 

Thus began an intense, multi-year con- 
trolled trial in reproducibility. Each shadow 
team consists of three to five researchers, 
who visit the ‘performer’ team’s laboratory 
and often host visits themselves. Between 3% 
and 8% of the programme’s total funds go to 
this independent validation and verification 
(IV&V) work. But DARPA has the flexibility and 


© 2020 Springer Nature Limited. All rights reserved. 


resources for such herculean efforts to assess 


essential techniques. In one unusual instance, 
an IV&V laboratory needed a sophisticated 
US$200,000 microscopy and microfluidic 
set-up to make an accurate assessment. 

These costs are high, but we think they are 
an essential investment to avoid wasting tax- 
payers’ money and to advance fundamental 
research towards beneficial applications. 
Here, we outline what we've learnt from imple- 
menting this programme, and how it could be 
applied more broadly. 


Engineering lessons 

Engineers expect their work to be subject to 
an IV&V process, in which the organization 
conducting the research uses a separate set 
of engineers to test, for example, whether 
microprocessors or navigation software 
work as expected. NASA’s IV&V facility was 
established more than 25 years ago and has 
around 300 employees testing code and sat- 
ellite components. 

In conventional electronics, IV&V relies on 
fundamental units such as transistors, diodes, 
capacitors and oscillators. The electronics 
industry takes great pains to promote the 
compatibility of these basic elements across 
platforms. For example, a given microproces- 
sor developed by Intel can function in both 


ILLUSTRATION BY DAVID PARKINS 


children’s toys and advanced physics labs. 

That compatibility does not apply to 
biological systems: proteins and cell lines are 
often notinterchangeable, and even identical 
proteins can function differently in different 
environments. First-principle explanations of 
how they work often do not exist. This compli- 
cates efforts to apply IV&V approaches devel- 
oped for electronics and software. What’s 
more, increasingly sophisticated bioengineer- 
ing tools are making cell-biology experiments 
more complicated, so thorough validation 
could take months or even years to complete. 
Few investigators have such resources. 

Instead, the biological sciences have 
depended on other, less-reliable techniques 
for reproducibility. The most long-stand- 
ing is the assumption that reproducibility 
studies will occur organically as different 
researchers work on related problems. In the 
past five years or so, funding agencies and 
journals have implemented more-stringent 
experimental-reporting and data-availability 
requirements for grant proposals and submit- 
ted manuscripts. A handful of initiatives have 
attempted to replicate select published stud- 
ies. The peer-reviewed Journal of Visualized 
Experiments creates videos to disseminate 
details that are hard to convey in conventional 
methods sections. 

Yet pitfalls persist. Scientists might waste 
resources trying to build on unproven tech- 
niques. And real discoveries can be labelled 
irreproducible because too few resources are 
available to conduct a validation. We were 
lucky enough to have the time, money and 
mandate to try something different. 


Making it work 


The synthetic-biology focus of DARPA’s Bio- 
logical Control programme is well suited to 
merging biological research with reproduci- 
bility studies. The programme aims to bring 
engineering principles of design and control 
to biology. By definition, this requires the 
adoption of best practices from the engineer- 
ing community — such as IV&V — to improve 
the likelihood that technologies can advance. 

Awardees were told from the outset that 
they would be paired with an IV&V team con- 
sisting of unbiased, third-party scientists 
hired by and accountable to DARPA. In this 
programme, we relied on US Department of 
Defense laboratories, with specific teams 
selected for their technical competence 
and ability to solve problems creatively. To 
get comfortable with the concept of IV&V, 
investigators needed reassurance that repli- 
cating teams would not steal ideas or derail 


publications. They also needed to get used 
to their results being challenged even before 
peer-review submission, and they needed 
reminders that cooperating with these teams 
was a programme requirement. 

Results so far show a high degree of exper- 
imental reproducibility. The technologies 
investigated include using chemical triggers 
to control how cells migrate’; introducing 
synthetic circuits that control other cell func- 
tions’; intricate protein switches that can be 
programmed to respond to various cellular 
conditions’; and timed bacterial expression 
that works even in the variable environment of 
the mammalian gut*. In the future, we expect 
replication efforts will be reported as sup- 
plemental data submitted with manuscripts. 


“Real discoveries can be 
labelled irreproducible 
because too few resources 
are available to conduct 
avalidation.” 


Especially when claims border on the fantas- 
tical, it is helpful to show peer reviewers and 
editors that an independent party has con- 
firmed the finding. So far, one publication 
co-authored by performer and IV&V teams 
has been accepted’, and two more are near- 
ing submission. Still, getting to this point was 
more difficult than we expected. It demanded 
intense coordination, communication and 
attention to detail. 

Successfully combining reproducibility 
studies with fundamental research required 
a level of coordination between laboratories 
and with the programme manager (P.E.S.) 
that none of us had experienced before. The 
manager worked with each project team to 
determine which of their many results merited 
validation on the basis of the desired impact 
and application. We wanted to know that the 
engineered organism — yeast, bacteria, slime 
moulds, mammalian cells or something else 
— could be modified reliably and that these 
modifications performed as expected, as well 
as what environmental conditions were essen- 
tial for that performance. 

Atypical academic lab trying to reproduce 
another lab’s results would probably limit itself 
toamonthor soand perhaps three or four per- 
mutations before giving up. Our effort needed 
capable research groups that could dedicate 
much more time (in one case, 20 months) and 
that could flexibly follow evolving research. 

Ultimately, the technologies that DARPA 
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is developing should end up being applied 
by many people for a broad range of uses. 
So in addition to assessing whether the tech- 
nologies worked, IV&V teams had to assess 
robustness. For instance, we needed to know 
what fraction of cells would incorporate new 
genetic material, especially when multiple 
genes and control elements were involved. 
We tested whether cells would still work in 
the same way if frozen and thawed months 
later, and whether they would retain their 
functionality after being grown continuously. 
One IV&V team checked whether migrationin 
a genetically modified cell line was faster than 
inits precursor, and fabricated guidancechips 
to determine what surfaces best directed cell 
migration. 

Achieving verification means communicat- 
ing effectively. Performer teams, particularly 
those with several principal investigators, 
had to designate someone to facilitate tele- 
conferences and site visits. Both teams present 
jointly to the programme manager at least 
twice a year. 

A key component of the IV&V teams’ effort 
has been to spend a day or more working with 
the performer teams in their laboratories. 
Often, members of a performer laboratory 
travel to the IV&V laboratory as well. These 
interactions lead to a better grasp of meth- 
odology than reading a paper, frequently 
revealing person-to-person differences that 
can affect results. This is especially true when 
theIV&V investigator does not regularly work 
with the same cell typeas the performer team, 
and thus approaches experiments ina similar 
way to other researchers who are building on 
anewly reported technique. 

Real-time collaboration minimizes or 
avoids logistical roadblocks that are known 
to prevent basic research validation (for 
example, when the original samples cannot be 
located, or the postdoctoral researcher with 
the necessary expertise is no longer with the 
laboratory). Still, our IV&V efforts have been 
derailed for weeks at atime for trivial reasons 
(see ‘Hard lessons’), suchas atypo that meant 
aningredient in cell media was off by an order 
of magnitude. We lost more than a year after 
discovering that commonly used biochemi- 
cals that were thought to be interchangeable 
are not. A five-laboratory consortium testing 
how cultured cells responded to cancer drugs 
reported similar experiences, with minor 
differences causing major effects°. 

Now, our IV&V efforts begin by catalogu- 
ing all chemicals, media and cell types, their 
suppliers and, for animal-derived extracts, 
lot numbers. Instruments are calibrated 
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HARD LESSONS 


Recommendation 


Whatto do 


Our experience 


Document reagents 


Include the vendor, product number and lot number 
for all reagents. 


We lost weeks of work and performed useless experiments when we 
assumed that identically named reagents (for example, polyethylene 
glycol or fetal bovine serum) from different vendors could be used 
interchangeably. 


See it live 


Watch an experiment carried out by another team. In our hands, 


washing cells too vigorously or using the wrong-size pipette tip 
changed results unpredictably. 


Site visits are mandatory because witnessing experiments in action 
reveals valuable information, such as how to trap Hydra without harming 
them, or how to tilt a cell plate. The benefits of site visits in terms of 
achieving reproducibility are worth the cost of plane tickets and lodging. 


State a range 


Rather than a single number, state a range of acceptable 
conditions for temperature, convection and other control 
standards. 


Knowing whether 21°C means 20.5-21.5°C or 20-22°C can tell you 
whether cells will thrive or wither, and whether you'll need to buy an 
incubator to make an experiment work. 


Test, then ship 


Immediately before shipping cells or a genetic construct for 
testing, check them or it. 


Incorrect, outdated or otherwise diminished products were sent to the 
IV&V team for verification many times. 


Double check 


If a standard protocol does not work, the performer and 
independent valuation and verification (IV&V) teams should 
work together on a step-by-step review. 


A typo in one protocol cost us four weeks of failed experiments, and in 
general, vague descriptions of formulation protocols (for example, for 
expressing genes and making proteins without cells) caused months of 
delay and cost thousands of dollars in wasted reagents. 


Pick a person 


Each performer team should designate one person to keep 
communication open, accurate and timely. 


The projects that lacked a dedicated and stable point of contact were the 
same ones that took the longest to reproduce. That is not coincidence. 


Keep in silico 
analysis up to date 


Data-analysis pipelines are replete with configuration decisions, 


assumptions, dependencies and contingencies that move 


Teams had to visit each others’ labs more than once to understand and 
fully implement computational-analysis pipelines for large microscopy 


quickly beyond documentation, making troubleshooting 


incredibly difficult. 


data sets. 


and settings (for example, microscope 
light source power) coordinated to avoid 
unintended side effects such as phototox- 
icity. Routine measurements in cell cul- 
ture, such as pH, osmolarity and testing for 
Mycoplasma, which often fall by the way- 
side, are prioritized. Each project creates a 
customized checklist depending on its cell 
lines, equipment and experiments. With- 
out this essential level of research hygiene, 
troubleshooting efforts would become an 
uninformative time sink. 

We have learnt to note the flow rates used 
when washing cells from culture dishes, to 
optimize salt concentration in each batch of 
medium and to describe temperature and 
other conditions witha range rather thana 
single number. This last practice came about 
after we realized that diminished slime-mould 
viability in our Washington DC facility was due 
to lab temperatures that could fluctuate by 
2°C on warm summer days, versus the more 
tightly controlled temperature of the per- 
former lab in Baltimore 63 kilometres away. 
Such observations can be written up in a pro- 
tocol paper. 

Sometimes, validation requires new equip- 
ment. For the slime moulds, independent val- 
idation meant buying an incubator that could 
keep cells stably at 21.5 °C, slightly belowthe 
IV&V laboratory’s ambient temperature. In 
another case, the performer team had to help 
install customized microfluidic and optical 
equipment at the IV&V lab because the stand- 
ard microscopes and analysis software used 
for live-cell imaging were not up to the task. 

All this makes for a considerably more 
variable IV&V programme than is found in 
microelectronics. But without these efforts, 
some promising technologies could have 
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been abandoned prematurely as seeming 
dead ends. 


Big dividends 

We think that the IV&V programme brings 
benefits beyond reproducing any individual 
project. Now, thereis a process to make investi- 
gations of disparate results more transparent. 
Performing reproducibility studies invariably 
forces scientists to think more deeply about 
their own experimental protocols and tech- 
niques. As one of our scientists said, “IV&V 
forces performers to think more critically 
about what qualifies as a successful system, 
and facilitates candid discussion about system 
performance and limitations.” Trainees told us 
that they have gained skill in analysing data, 
providing constructive criticism and design- 
ing and documenting their own research so 
that it can be reproduced. 

IV&V teams gained further advantages. For 
example, because service laboratories become 
well-versed in the mindset and protocols for 
new technologies even before publications 
appear, they are well-poisedtointegrate them 
into their offerings, predict future directions 
for the field and move research more quickly 
to applications. The IV&V programme also 
expands networking opportunities between 
DARPA scientists and the top-quality labs 
DARPA funds, including the potential to 
recruit postdocs and graduate students across 
laboratories. Not surprisingly, many DARPA 
BTO programmes in recent years have incor- 
porated some form of IV&V to help validate 
programme results. 

As we continue the Biological ControlIV&V 
programme, we expect to find more ways to 
improve it, to better quantify its benefits and 
to codify best practices, such as incorporating 
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automation and robotics where possible 
and keeping an open line of communication 
between performer groups and IV&V teams. 
Althoughsome of the lessons learnt from the 
first stages might seem obvious and trite, that 
also reinforces their necessity. 

We think that a dedicated shift towards the 
IV&V model by more research institutions and 
funding agencies will bring more reliable and 
cost-effective science. Programme officers 
at other granting agencies should consider 
allocating a portion of their funding stream 
to independent reproducibility efforts. This 
will both reduce the number of papers that 
cannot be replicated and improve the qual- 
ity of work that funding agencies support. 
Metrics will need to be established to quan- 
tify the cost savings of applying this model 
to synthetic biology and bioengineering, but 
given its successful integration throughout 
more conventional engineering disciplines, we 
are optimistic that the returns will be worthit. 
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Readers respond 


Correspondence 


COVID-19: don’t 
ignore Taiwan 


As the COVID-19 epidemic 
unfolds, history is repeating 
itself in Taiwan. Still denied 
membership of the World 
Health Organization (WHO) 
and therefore participation 

in international decisions, 
Taiwan could again experience 
a disproportionate number of 
deaths, as it did during the 2003 
SARS epidemic (see Nature 
422, 652; 2003). Taiwan needs 
help and, inturn, the WHO has 
everything to gain from allowing 
it tojoin in the fight against this 
crisis. 

The COVID-19 epidemic calls 
for aresponse consistent with 
the principles of the WHO (see 
go.nature.com/2tbgqrd). In my 
view, Taiwan’s alienation is an 
inexcusable liability for global 
health. Its health-care system 
is ranked first inthe world 
by NUMBEO (see go. nature. 
com/2wbackc). Its researchers 
identified receptor-binding 
proteins of the 2003 SARS virus 
(see go.nature.com/3cqqn82), 
established animal models 
for testing vaccines against it, 
and are now pursuing vaccine 
research and development 
against COVID-19. 

Taiwan is separated from 
mainland China by amere 
strait, across which thousands 
travelled to Taiwan every 
day until Taiwan imposed 
entry restrictions last month 
because of the epidemic. The 
WHO should look again at its 
exclusion of Taiwan. There is no 
place for political disputes when 
millions of lives are at stake. 


Chase W. Nelson Biodiversity 
Research Center, Academia 
Sinica, Taipei, Taiwan. 
cnelson@gate.sinica.edu.tw 


COVID-19: keep up 
with latest papers 


An open-resource literature hub 
known as LitCovid curates the 
most comprehensive collection 
of international research 

papers so far on the new 
coronavirus disease COVID-19 
(see go.nature.com/3almd5p). 
Developed with the support of 
the US National Institutes of 
Health’s intramural research 
programme, LitCovid is updated 
daily with newly published 
articles. The aim is to provide 
timely insight from the scientific 
literature into the biology of 

the virus and the diagnosis and 
management of those who have 
been infected. 

LitCovid has amore 
sophisticated search function 
than existing resources. 
Itidentifies roughly 35% 
more relevant articles than 
do conventional keyword- 
based searches for entries 
such as ‘COVID-19’ or ‘nCOV’. 
Furthermore, the articles 
are categorized by topic — 
overview, disease mechanism, 
transmission dynamics, 
treatment, case report and 
epidemic forecasting — as well 
as by geographic location for 
visualization on a world map. 

We welcome user feedback for 
further enhancement. 


Qingyu Chen, Alexis Allot, 
Zhiyong Lu National Center 
for Biotechnology Information, 
National Library of Medicine, 
National Institutes of Health, 
Bethesda, Maryland, USA. 
zhiyong.lu@nih.gov 


Disaster research: 
for volunteers only 


In our view, a code of conduct for 
research done in disaster zones 
should include a guarantee that 
people in the area can choose 
whether or not to participate 
(see J. C. Gaillard and L. Peek 
Nature 575, 440-442; 2019). Fair 
and voluntary participation isa 
fundamental human right. 

Residents ina disaster area 
fear for their lives in the acute 
phase, and face health risks 
inthe recovery phase. They 
naturally seek help and support 
from their government and 
from professionals. Under such 
circumstances, residents might 
agree to take part in research 
without giving the matter 
enough thought. It is therefore 
important for researchers to 
make clear to them that studies 
could have harmful effects as 
well as benefits. 

As anexample, after the 2011 
accident at the Fukushima 
Daiichi nuclear-power station, 
we undertook screening of 
local children for thyroid 
cancer, which can be induced by 
radioactive iodine (A. Ohtsuru 
et al.JAMA Otolaryngol. Head 
Neck Surg. 145, 4-11; 2019). 
Although participants and their 
parents gave written consent, 
they were unaware of the 
risks of overdiagnosis, which 
include having unnecessary 
surgery to remove small, 
slow-growing tumours (see 
go.nature.com/2vfd9z7). Also, 
screening conducted during 
school time could have given the 
impression that participation 
was mandatory. 

Such ambiguity underlines 
the importance of acode 
of conduct that makes 
participation in disaster-zone 
research voluntary. 


Sanae Midorikawa, Akira Ohtsuru 
Fukushima Medical University, 
Fukushima, Japan. 
otobokesoh@gmail.com 


Disaster research: 
feedback to society 


In our experience, the 
recovery of disaster zones 

can be improved by providing 
research feedback to affected 
communities (seeJ. C. Gaillard 
and L. Peek Nature 575, 
440-442; 2019). 

In 2017, we questioned 
communities that had been 
evacuated following the 
2011 Tohoku earthquake and 
Fukushima nuclear accident 
inJapan. We asked them about 
their awareness of the risks 
of living inthe region and the 
factors that had been important 
for recovery. We then fed our 
results back to them — much to 
their surprise, because we were 
the first researchers ever to have 
done so. These communities 
knew nothing about the 
20,000 publications related to 
the disaster. 

In our view, this remoteness 
of researchers from those 
affected by such catastrophes 
needs to be corrected. It could 
stem from mere oversight 
or from a reluctance to 
sacrifice time that might 
otherwise be spent writing 
papers or grant applications. 
However, meetings with local 
communities before and after 
data collection are ethical and 
productive. Furthermore, 
reporting the details to guide 
the next generation of disaster- 
zone researchers should be 
mandatory. 
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Identifying Earth’s building blocks from terrestrial rocks is 
challenging because these ingredients have become mixed as 
the planet evolved. Evidence of an unknown building block in 
ancient rocks provides fresh insight. See p.240 


Earth formed from an unknown selection 
of meteoritic material. On page 240, 
Fischer-Gédde et al.’ report that the compo- 
sition of ruthenium isotopes in ancient rocks 
from southwest Greenland contains evidence 
ofa previously unrecognized building block of 
Earth. Surprisingly, the inferred isotopic com- 
position of ruthenium in the material does not 
match known meteorite compositions. The 
authors’ findings suggest that Earth’s volatile 
components, suchas water and organic com- 
pounds, could have arrived during the final 
stages of the planet’s growth. 

Our planet is the product of a series of 
collisions of increasingly large celestial 
bodies? *. These building blocks accreted 
froma protoplanetary disk of dust and gas that 
orbited the proto-Sun about 4.6 billion years 
ago. Identifying the compositions of Earth’s 
building blocks is difficult because of our 
limited access to the disk’s remnants, and 
because of the complex, long-term geologi- 
cal processing of the mantle that has mixed 
Earth’s ancient ingredients. 

Potential answers to the question of what 
Earth is made of can come from studies in 
which the isotopic compositions of terres- 
trial rock samples are compared with those 
of meteorites that formed within the first few 
million years of the Solar System’s history. 
These meteorites are presumed to be repre- 
sentative of the smaller bodies that ultimately 
coalesced to form the rocky planets. Conse- 
quently, meteorites are our most promising 
candidates for Earth’s building blocks. 

Fischer-Gédde and colleagues’ study builds 
on the finding that meteorites have charac- 
teristic isotope compositions that serve as 
fingerprints to distinguish different types 
of potential building block. For example, 
meteorites such as carbonaceous chondrites, 


which are often ‘wet’ (that is, they contain 
volatile components), have different isotopic 
fingerprints from meteorites that are generally 
‘dry®. The differences in isotopic composition 
originate from the heterogeneous distribution 
of stardust inthe protoplanetary disk, andare 
knownas nucleosynthetic isotope variations. 
If the fingerprints could be identified in terres- 
trial rock samples, this might provide evidence 
of the material from meteorites that Earth was 
built from. 

The documentation of fingerprints in 
terrestrial rocks could help to constrain esti- 
mates of when volatile elements were delivered 
to Earth and where they came from. This is 
because the abundances of certain isotopes 
of some elements — ruthenium-100 (°°Ru), 
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for example — not only distinguish between 
wet and dry building blocks, but also trace 
different stages of Earth’s accretion history. 

Ruthenium is classified as a highly sidero- 
phile (iron-loving) element, because it collects 
in metal-rich phases of Earth’s interior. Con- 
sequently, most of our planet’s ruthenium is 
concentrated in its metallic core. There are, 
however, traces of ruthenium and other highly 
siderophile elements (HSEs) in the mantle, 
and their relative proportions approximate 
to those measured in primitive meteorites®. 
One interpretation of this is that the HSEs were 
added to the mantle after the core formed, 
during an event called the late veneer — when 
the final approximately 0.5% (of the total per- 
centage weight) of Earth’s mass accreted’®. 
If so, then ruthenium and other HSEs in the 
mantle record the composition of the last 
material that accreted to Earth’. 

It has been proposed that Earth’s volatile 
elements were also added during the late 
veneer, possibly by the accretion of carbo- 
naceous chondrites’°". Studies in the past 
few years, however, have found a mismatch 
between the °Ru-isotope composition (the 
abundances of °°Ru in terrestrial rocks) in 
Earth’s mantle and that in carbonaceous chon- 
drites”?. It could therefore be concluded that 
carbonaceous chondrites did not form part 
of the late veneer, thus casting doubt onthe 
timing of the delivery of volatiles to Earth”. 

This conclusion rests onthe assumption that 
HSEs in the mantle do not contain significant 
quantities of material from before the late 
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Figure 1| A scenario for the preservation of ancient material in Earth’s mantle. a, Between 4.6 billion 

and about 4.5 billion years ago, Earth formed from the accretion of material from meteorites. Siderophile 
elements, which have a strong affinity for metals, segregated into the core. b, The final approximately 0.5% of 
the total percentage weight of Earth’s mass accreted from meteorites during an event called the late veneer, 
after the core had formed. c, Fischer-Gédde et al.' report that ancient rocks from southwest Greenland have 
an unusual ruthenium-isotope composition. They attribute this to the presence of pre-late-veneer mantle 
material in the rocks. The distribution of pre-late-veneer material shown here is speculative; the actual 
amount and distribution cannot be derived from the available data. 
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veneer — a reasonable assertion, given that 
there is limited direct evidence of this. If the 
pre-late-veneer mantle did contain a sub- 
stantial amount of °Ru that did not collect 
inthe core, and that was identifiable by hav- 
ing a different °Ru-isotope composition 
from that of the modern mantle, then car- 
bonaceous chondrites could still have been 
accreted during the late veneer. 

Nucleosynthetic ruthenium-isotope 
variations have not been reported for terres- 
trial rocks before now. This is, in part, because 
Earth has active plate tectonics and mantle 
convection, which mix and dilute the finger- 
prints of its building blocks. However, in the 
past few years, analytical methods" have been 
further developed that enable isotope varia- 
tions to be measured onthe scale of parts per 
million, making it possible to search for these 
primitive isotopic signatures. 

By comparing the °°Ru-isotope composi- 
tions of terrestrial rocks with those of meteor- 
ites, Fischer-Gddde and co-workers report that 
an ancient part of Earth, preserved in rocks 
from southwest Greenland, retains the finger- 
prints of an unusual building block (Fig. 1). The 
fact that the inferred isotope compositions 
donot match known meteorite compositions 
indicates that current meteorite collections 
are considerably limited in their sampling of 
the protoplanetary disk. 

The authors interpret their unusual 
100Ru data as the isotopic signature of pre- 
late-veneer ruthenium inthe source of these 
rocks. Considering their findings in the 
context of the compositions of other HSEs 
in the mantle, the authors suggest that the 
modern composition of the mantle can be 
reconciled with their new data only if the late 
veneer contained carbonaceous chondrites 
to counterbalance the composition of the 
pre-late-veneer component of the mantle. This 
would mean that volatiles could have been 
delivered to Earth during the final stages of 
the planet’s formation. 

Fischer-Godde and colleagues’ data answer 
thelong-standing question of whether Earth’s 
diverse building blocks are preserved and 
accessible for study. But the data also raise 
key questions, the answers to which will 
undoubtedly determine the importance of the 
new findings. For example, how representa- 
tive of the pre-late-veneer mantle is the suite 
of rock samples from southwest Greenland? 
Are nucleosynthetic fingerprints observed in 
the isotopic compositions of other elements 
in the mantle? What is the composition of 
the ‘missing’ meteorites that dominated the 
ruthenium composition of the pre-late-veneer 
mantle, and why has it not yet been identified? 
And how was the isotopic signature of these 
meteorites preserved in the convecting man- 
tle? These questions can be addressed only 
by expanding the search for nucleosynthetic 
fingerprints inthe mantle. 
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Tweaking DNA of myeloid 
cells curbs cancer spread 


Ali Ghasemi & Michele De Palma 


Blood cells called myeloid cells can facilitate metastasis — the 
spread of atumour to distant organs. Taming these cells with 
drugs that alter the chemical structure of their DNA limits 


metastasis in mice. See p.284 


Many tumours are successfully treated with 
surgery. But cancer might recur at the surgical 
site or in distant organs, so surgery is often fol- 
lowed by treatment termed adjuvant therapy. 
This limits the risk of relapse by killing cancer 
cells remaining at the surgical site or those that 
have already moved elsewhere. However, adju- 
vant therapy is not always effective. Moreover, 
it might not prevent certain processes that aid 
cancer resurgence, such as the recruitment 
of blood cells called myeloid cells to distant 
organs, where they can lay the foundation 
for cancer cells to settle and thrive’. Lu et al.” 
reveal on page 284 how the chemical structure 
of the DNA in the nucleus of myeloid cells is a 
vulnerability that can be harnessed to target 
these tumour-promoting cells and limit cancer 
spread. 

Tumour spread from its primary site to 
distant organs, which is called metastasis, 
involves complex interactions between cancer 
cells and the surrounding healthy tissues. Evi- 
dence is growing that primary tumours can 
produce signals that modify normal cells to 
generate a ‘soil’ in distant organs — termed a 
pre-metastatic niche — that permits subse- 
quent ‘seeding’ and establishment of cancer 
cells at this secondary site®. Such secondary 
tumours, or metastases, are often lethal. 

Several types of cancer metastasize from 
their primary site to the lung. Efficient meta- 
stasis of breast cancer cells to the lung in 
mice requires the participation of a type of 
myeloid cell called a monocyte". These cells 
normally function to fight infections, but they 
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can also supply metastasizing cancer cells with 
factors that help them to get established and 
growat asecondary site’*. Certain molecules 
produced by the primary tumour alter the 
properties of monocytes and increase their 
numbers in the bloodstream, fostering the 
monocytes’ tumour-supporting functions in 
the pre-metastatic niche'*. Therefore, block- 
ing such ‘tumour-educated’ monocytes might 
inhibit metastasis. Previous work" indicates 
that neutralizing a protein called CCL2, which 
is produced by the tumour and promotes 
monocyte accumulation in the lungs of mice, 
impairs metastasis in mouse tumour models. 
However, this approach was unsuccessful in 
clinical trials owing to difficulties in effectively 
neutralizing CCL2 (ref. 4). 

Lu et al. studied mice that were given 
transplants under the skin of atype of tumour 
that metastasizes to the lung. Consistent 
with previous studies*”, the authors found 
that two types of myeloid cell — monocytes 
and neutrophils — accumulated in the lung 
before metastases were detectable there. 
These tumour-elicited myeloid cells are col- 
lectively called myeloid-derived suppressor 
cells (MDSCs) owing to their ability to suppress 
the immune response against a tumour’®. Con- 
firming the metastasis-promoting capacity of 
MDSCs, the authors report that elimination 
of MDSCs delayed the metastasis of cancer 
cells to the lung and extended the animals’ 
survival. 

Gene expression can be regulated by 
changes in the nucleus termed epigenetic 
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veneer — a reasonable assertion, given that 
there is limited direct evidence of this. If the 
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convection, which mix and dilute the finger- 
prints of its building blocks. However, in the 
past few years, analytical methods" have been 
further developed that enable isotope varia- 
tions to be measured onthe scale of parts per 
million, making it possible to search for these 
primitive isotopic signatures. 

By comparing the °°Ru-isotope composi- 
tions of terrestrial rocks with those of meteor- 
ites, Fischer-Gddde and co-workers report that 
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new findings. For example, how representa- 
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of rock samples from southwest Greenland? 
Are nucleosynthetic fingerprints observed in 
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Blood cells called myeloid cells can facilitate metastasis — the 
spread of atumour to distant organs. Taming these cells with 
drugs that alter the chemical structure of their DNA limits 


metastasis in mice. See p.284 


Many tumours are successfully treated with 
surgery. But cancer might recur at the surgical 
site or in distant organs, so surgery is often fol- 
lowed by treatment termed adjuvant therapy. 
This limits the risk of relapse by killing cancer 
cells remaining at the surgical site or those that 
have already moved elsewhere. However, adju- 
vant therapy is not always effective. Moreover, 
it might not prevent certain processes that aid 
cancer resurgence, such as the recruitment 
of blood cells called myeloid cells to distant 
organs, where they can lay the foundation 
for cancer cells to settle and thrive’. Lu et al.” 
reveal on page 284 how the chemical structure 
of the DNA in the nucleus of myeloid cells is a 
vulnerability that can be harnessed to target 
these tumour-promoting cells and limit cancer 
spread. 

Tumour spread from its primary site to 
distant organs, which is called metastasis, 
involves complex interactions between cancer 
cells and the surrounding healthy tissues. Evi- 
dence is growing that primary tumours can 
produce signals that modify normal cells to 
generate a ‘soil’ in distant organs — termed a 
pre-metastatic niche — that permits subse- 
quent ‘seeding’ and establishment of cancer 
cells at this secondary site®. Such secondary 
tumours, or metastases, are often lethal. 

Several types of cancer metastasize from 
their primary site to the lung. Efficient meta- 
stasis of breast cancer cells to the lung in 
mice requires the participation of a type of 
myeloid cell called a monocyte". These cells 
normally function to fight infections, but they 
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can also supply metastasizing cancer cells with 
factors that help them to get established and 
growat asecondary site’*. Certain molecules 
produced by the primary tumour alter the 
properties of monocytes and increase their 
numbers in the bloodstream, fostering the 
monocytes’ tumour-supporting functions in 
the pre-metastatic niche'*. Therefore, block- 
ing such ‘tumour-educated’ monocytes might 
inhibit metastasis. Previous work" indicates 
that neutralizing a protein called CCL2, which 
is produced by the tumour and promotes 
monocyte accumulation in the lungs of mice, 
impairs metastasis in mouse tumour models. 
However, this approach was unsuccessful in 
clinical trials owing to difficulties in effectively 
neutralizing CCL2 (ref. 4). 

Lu et al. studied mice that were given 
transplants under the skin of atype of tumour 
that metastasizes to the lung. Consistent 
with previous studies*”, the authors found 
that two types of myeloid cell — monocytes 
and neutrophils — accumulated in the lung 
before metastases were detectable there. 
These tumour-elicited myeloid cells are col- 
lectively called myeloid-derived suppressor 
cells (MDSCs) owing to their ability to suppress 
the immune response against a tumour’®. Con- 
firming the metastasis-promoting capacity of 
MDSCs, the authors report that elimination 
of MDSCs delayed the metastasis of cancer 
cells to the lung and extended the animals’ 
survival. 

Gene expression can be regulated by 
changes in the nucleus termed epigenetic 


modifications. These include processes that 
add or remove either methyl groups on DNAor 
acetyl groups on DNA-binding proteins called 
histones. Such epigenetic modifications often 
alter the expression of many genes in cancer 
cells, which might lead to loss of expression of 
genes that function as tumour suppressors, 
or to high expression of tumour-promoting 
genes. Anticancer treatments that use drugs 
to modulate epigenetic changes (termed epi- 
genetic therapies) are gaining momentum®. 
Some of these drugs have been tested in clin- 
ical trials and approved for the treatment of 
certain blood tumours’. 

Epigenetic therapy is usually aimed at cancer 
cells, but earlier work’* indicates that such 
treatment can also target tumour-promoting 
myeloid cells. Luand colleagues investigated 
the effect of using low doses of two such drugs: 
5-azacytidine (which inhibits the addition of 
methyl groups to DNA) and entinostat (which 
inhibits the removal of acetyl groups from 
histones). This treatment did not kill cancer 
cells in vitro, nor did it inhibit the growth of 
primary tumours in mice. However, when mice 
that had undergone tumour-removal surgery 
were treated with the drugs, the migration of 
myeloid cells to the lungs and the formation of 
lung tumours were reduced (Fig. 1) compared 
with the effects in mice that did not receive the 
drugs. Lu and colleagues report that a simi- 
lar treatment is well tolerated by people who 
have lung cancer, and preliminary evidence 
presented by the authors also suggests that it 
might limit tumour recurrence after surgery. 

Myeloid cells are made in the bone 
marrow and released into the bloodstream. 
Lu and colleagues report that epigenetic 
therapy modulated the gene expression of 
myeloid cells in the bone marrow of mice. 
By altering the chemical structure of DNA, 
epigenetic drugs can modify the ability of 
transcription-factor proteins to bind DNA 
and promote the expression of certain genes. 

Epigenetic therapy suppressed the activity 
of a transcription factor called NF-KB in the 
marrow-resident myeloid cells. NF-KB controls 
the expression of CCR2 (which is the receptor 
for CCL2) and CXCR2 (whichis the receptor for 
another tumour-derived protein called CXCL1). 
After treatment, monocytes expressed lower 
levels of CCR2, and neutrophils expressed 
lower levels of CXCR2. CCL2 and CXCL1 drive 
the recruitment and retention of myeloid cells 
inthe lung**. Therefore, drug-induced inhibi- 
tion of CCR2 and CXCR2 expression probably 
impaired the process that enables a pre-met- 
astatic niche in the lung to be populated by 
tumour-promoting myeloid cells. 

Lu etal. observed that the drug-mediated 
epigenetic reprogramming of myeloid cells 
was maintained during their exit from the 
bone marrow and when they reached the 
lung. Indeed, in addition to NF-kB, the activ- 
ity of several transcription factors known to 
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Figure 1| Drug therapy that blocks cancer spread in mice. Even if a tumour is surgically removed, remaining 
cancer cells might spread to a distant organ such as the lung. a, Non-cancerous cells called monocytes and 
neutrophils, which express the proteins CCR2 and CXCR2, respectively, arise in the bone marrow. Their 
numbers increase in the bloodstream if the cells are mobilized by tumour-derived factors (factors not shown). 
The cells can travel through blood vessels to reach the lungs and forma site called a pre-metastatic niche. This 
site aids the establishment of cancer cells there (called a metastasis). b, After animals had undergone tumour- 
removal surgery, Lu etal.’ treated them with epigenetic therapy, which consisted of two drugs that target the 
chemical structure of DNA. This treatment decreased the expression of CCR2 in monocytes and CXCR2 in 
neutrophils, and fewer of these cells reached the lung. The epigenetic therapy also caused some monocytes 

to differentiate into a cell called an interstitial macrophage. The impaired recruitment of tumour-promoting 
monocytes and neutrophils to the lungs was accompanied by a decrease in the number and size of metastases. 


orchestrate the generation of a pre-metastatic 
niche’ was suppressed in the myeloid cells that 
made their way to the lung. Intriguingly, the 
authors found that the epigenetic therapy 
caused monocytes to differentiate into a type 
of cell called an interstitial macrophage, which 
might lack the ability to promote tumour 
establishment at a secondary site. The drugs 
also caused this type of cellular conversionin 
monocytes that were experimentally trans- 
ferred from the bone marrow of untreated 
mice to the bloodstream of drug-treated mice. 
How the drugs drive this change is unknown. 
Perhaps their ability to rewire a cell’s transcrip- 
tional network underlies this phenomenon. 

Luand colleagues’ findings indicate that the 
re-education of tumour-hijacked myeloid cells 
using epigenetic therapy might bea promising 
strategy for thwarting their metastasis-pro- 
moting capacity. These cells have several 
tumour-promoting functions, such as boost- 
ing the formation of blood vessels that aid 
tumour growth or suppressing antitumoral 
T cells of the immune system>”. Thus, rather 
than having to individually target factors that 
aid different processes needed for tumour 
growth, epigenetic therapy might bea way to 
neutralize the broad and multifaceted capacity 
of myeloid cells to aid cancer. 

Drugs such as S-azacytidine and entinos- 
tat are being tested in the clinic for use in 
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combination with other anticancer therapies, 
both before tumour-removal surgery and in 
the setting of adjuvant therapy after surgery””. 
These drugs are not approved yet for treating 
solid (non-blood-cell) tumours in the clinic. 
High doses of such drugs are associated with 
severe toxicity, so using low doses that suffice 
to reprogram myeloid cells, together with 
other drugs that target cancer cells, might 
provide an effective anticancer treatment that 
avoids unwanted toxicity. 
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DNA-repair enzyme 
turns totranslation 


Alan J. Warren 


A key DNA-repair enzyme has a surprising role during 
the early steps in the assembly of ribosomes — the 
molecular machines that translate the genetic code 


into protein. See p.291 


Every minute, each human cell constructs 
up to 7,500 ribosomes — essential intracel- 
lular factories that decode instructions from 
genes to make all the proteins in the body. 
Ribosomes are assembled from four distinct 
ribosomal RNA (rRNA) molecules and 80 dif- 
ferent proteins, which form small and large 
subunits, inacomplex process involving more 
than 200 assembly factors. A better under- 
standing of the underlying mechanisms might 
help to explain the devastating consequences 
of genetic mutations known as ribosomopa- 
thies that affect this assembly pathway. On 
page 291, Shao et al.' identify an unexpected 
role for the enzyme DNA-dependent protein 
kinase (DNA-PK) — acore component of the 
machinery for repairing DNA double-strand 
breaks (DSBs) — in the early steps of ribosome 
assembly. 

Cells must repair DSBs promptly, because 
they threaten genomic stability and can lead 
to cell death or cancer. Non-homologous 
end joining (NHE)) is a main pathway for DSB 
repair. A dimeric protein complex called KU 
initiates this process by binding to the broken 
DNA ends, then recruiting the DNA-PK cata- 
lytic subunit (DNA-PKcs) to form the active 
DNA-PK enzyme (Fig. 1a). DNA-PK, through 
its kinase activity, adds phosphate groups 
to the side chains of serine and threonine 
amino acids in other proteins, and heavily 
regulates itself by phosphorylating a cluster 
of amino acids near its serine 2056 (S2056) and 
threonine 2609 (T2609) residues. This activity 
leadstotherecruitmentofotherenzymes,suchas 
Artemis, that process and join the broken 
DNA strands. 

In acomprehensive series of genetic experi- 
ments, Shao and colleagues established that 
both the kinase activity of DNA-PKcs and 
phosphorylation at its T2609 cluster are 
crucial for blood development (haemato- 
poiesis) in mice. Mice that entirely lacked 
both DNA-PKcs and the tumour-suppressor 
protein p53 developed a type of blood can- 
cer and died. By contrast, animals that did 
not have p53 and carried a mutant form of 
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DNA-PKcs lacking kinase activity survived. 
However, they developed a disease of the bone 
marrow reminiscent of a blood cancer called 
myelodysplastic syndrome. Moreover, mice 
in which amino-acid residues in the T2609 
cluster were replaced by alanine residues 
(which could not be phosphorylated) died at 
four weeks old and had severe p53-dependent 
anaemia associated with reduced protein syn- 
thesis. This condition was reminiscent of the 
ribosomopathy Diamond-Blackfan anaemia 
(DBA), whichis caused by mutationsin any one 
of 18 different ribosomal proteins’. 

Shao etal. showed that deletion of the KU 
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protein completely restored haematopoiesis 
in mice that had mutations in the T2609 clus- 
ter, ruling out defective DNA repair alone as 
the explanation for the blood disorders. What, 
then, might DNA-PK be doing in this context? 

The first precursor of the small riboso- 
mal subunit, known as the small-subunit 
processome, is assembled around an RNA 
called U3 (ref. 3; Fig. 1b) in a subcellular com- 
partment called the nucleolus. Shao and col- 
leagues confirmed previous reports*> that 
a proportion of KU and DNA-PKcs resides in 
the nucleolus. These observations suggested 
a link between KU, DNA-PKcs and ribosome 
assembly. The authors provided evidence 
that supports this link by using U3 as ‘bait’ 
to identify components of the small-subunit 
processome, which included DNA-PKcs and 
KU, but not other NHE) factors. 

The small ribosomal subunit is partly com- 
prised of anrRNA called 18S. The researchers 
found that unprocessed precursors of 18S 
rRNA accumulated in cells that lacked DNA- 
PKcs kinase activity, but did not accumulate 
when KU was deleted, too. Moreover, mice 
and cell lines lacking DNA-PKcs kinase activ- 
ity showed reduced global protein synthesis. 
The authors used a technique called infrared 
crosslinking immunoprecipitation (irCLIP) 
to track down DNA-PKcs and KU to a specific 
location of the processome, near U3. Finally, 
they found that a structured fragment of U3 
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Figure 1| Two roles for DNA-dependent protein kinase. a, To repair DNA double-strand breaks, the 
DNA-dependent protein kinase catalytic subunit (DNA-PKcs) is recruited to DNA ends by the KU protein 
dimer. DNA-PKcs phosphorylates itself (P) on an amino-acid cluster near its threonine 2609 (T2609) residue. 
This enables the DNA-cleaving enzyme Artemis to access broken DNA ends, which are processed and joined. 
b, Shao et al.’ have found another role for DNA-PKcs: in the synthesis of the cell’s protein-producing factory, 
the ribosome. Precursor ribosomal RNA (pre-rRNA), which contains a region dubbed 18S, forms part of the 
ribosomal small-subunit processome. The authors find that KU recruits DNA-PKcs to another RNA molecule 
in the processome, U3. Self-phosphorylation might trigger an RNA-dependent conformational change 

in DNA-PKcs, regulating access of an RNA-cleaving enzyme such as UTP24, which cleaves the pre-rRNA to 
produce mature 18S rRNA that forms part of the ribosome. 
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drives the assembly of DNA-PK and stimulates 
its catalytic activity in vitro, although does so 
much less efficiently than can DNA. 

Taken together, these observations suggest 
a modelin which KU recruits DNA-PKcs to the 
small-subunit processome. In the case of 
kinase-defective DNA-PK, the mutant enzyme’s 
inability to regulate its own activity gives the 
protein anew function, blocking the process- 
ing of precursor rRNA into mature 18S rRNA 
in the small-subunit processome. The result- 
ing defect in global protein synthesis drives a 
p53-dependent loss of red-blood-cell precur- 
sors — acell type that has an especially high 
physiological demand for protein synthesis. 
The parallels with NHEJ are intriguing: in that 
pathway, the complete deletion of DNA-PKcs 
results in only a minor reduction in repair 
fidelity, and the joining of broken DNA ends 
is retained. By contrast, the kinase-inactive 
DNA-PKcs mutant is wholly unable to carry 
out end joining. 

The specific role of DNA-PK in precursor 
rRNA processing, and how it recognizes pre- 
cursor rRNA in vivo, remains unclear. However, 
structural analysis of the yeast small-subunit 
processome¢’ has revealed that U3 acts as a 
molecular guide that docks the processome 
onto the precursor rRNA by forming four 
evolutionarily conserved duplexes (hinges) 
between the two components: two hinges 
ina highly branched region of the precursor 
rRNA, and two ina region that will become the 
mature 18S rRNA. These hinges are a prerequi- 
site for three cleavage events, mediated by an 
RNA-cleaving nuclease enzyme, that release 
the 18S rRNA ready to make the small subunit. 

Shao etal. show that DNA-PK and KU primar- 
ily interact with U3 at this hinge region. Thus, 
muchas DNA-PKcs recruits the DNA-cleaving 
enzyme Artemis during the NHE] processing 
of DNA ends’, with U3, DNA-PKcs might also 
help to recruit specific RNA-cleaving nucleases 
(such as UTP24) to the small-subunit pro- 
cessome to cleave the precursor rRNA for 
ribosome construction. 

Structural studies suggest that the binding 
of DNA-PKcs to KUand DNA could regulate the 
activation of DNA-PKcs kinase activity alloster- 
ically, that is, by changing the conformation of 
the enzyme®”. In the future, it will be inter- 
esting to compare RNA- and DNA-dependent 
conformational changes in DNA-PKcs. The 
physiological relevance of the broad array of 
RNA partners identified by Shao etal. in their 
irCLIP analysis also remains to be dissected. 

Shao and colleagues’ study has identified 
an interesting player in ribosome assembly 
that might efficiently couple DNA DSB repair 
with processing of precursor rRNA, which is 
highly transcribed from the naturally unstable 
ribosomal DNA template. Broadly, the find- 
ings encourage us to critically evaluate how 
dynamic redistribution of DNA-PK might 
allow the cell to couple DSB repair with the 


regulation of protein synthesis. And, although 
further studies are required, we might have 
taken a step closer to deciphering the 
mysterious ribosomopathies. 
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Tiny fossil sheds light on 
miniaturization of birds 


Roger B. J. Benson 


A tiny skull trapped in 99-million-year-old amber suggests that 
some of the earliest birds evolved to become miniature. The 
fossil illustrates how ancient amber can act as a window into 


the distant past. See p.245 


Dinosaurs were big, whereas birds — which 
evolved from dinosaurs — are small. This var- 
iation is of great importance, because body 
size affects lifespan, food requirements, sen- 
sory capabilities and many other fundamental 
aspects of biology. The smallest dinosaurs! 
weighed hundreds of grams, but the smallest 
living bird, the bee hummingbird (Mellisuga 
helenae)’, weighs only 2 grams. How did this 
difference come about, and why? On page 245, 
Xing et al? describe the tiny, fossilized, bird- 
like skull of a previously unknown species, 
which they name Oculudentavis khaungraae. 


The discovery suggests that miniature body 
sizes in birds evolved earlier than previously 
recognized, and might provide insights into 
the evolutionary process of miniaturization. 

Fossilization of bones in sediments such 
as clay, silt and sand can crush and destroy 
the remains of small animals, and can flatten 
and decay soft parts suchas skin, scales and 
feathers. By contrast, preservation of small 
animals in Burmese amber (which formed 
from the resin flows of coniferous trees about 
99 million years ago) helps to protect their soft 
parts. A wide range of invertebrates‘ and small 


Figure 1| Computed tomography scan of the skull of Oculudentavis khaungraae. Xing et al.’ have 
characterized this 99-million-year-old fossil bird. 
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drives the assembly of DNA-PK and stimulates 
its catalytic activity in vitro, although does so 
much less efficiently than can DNA. 

Taken together, these observations suggest 
a modelin which KU recruits DNA-PKcs to the 
small-subunit processome. In the case of 
kinase-defective DNA-PK, the mutant enzyme’s 
inability to regulate its own activity gives the 
protein anew function, blocking the process- 
ing of precursor rRNA into mature 18S rRNA 
in the small-subunit processome. The result- 
ing defect in global protein synthesis drives a 
p53-dependent loss of red-blood-cell precur- 
sors — acell type that has an especially high 
physiological demand for protein synthesis. 
The parallels with NHEJ are intriguing: in that 
pathway, the complete deletion of DNA-PKcs 
results in only a minor reduction in repair 
fidelity, and the joining of broken DNA ends 
is retained. By contrast, the kinase-inactive 
DNA-PKcs mutant is wholly unable to carry 
out end joining. 

The specific role of DNA-PK in precursor 
rRNA processing, and how it recognizes pre- 
cursor rRNA in vivo, remains unclear. However, 
structural analysis of the yeast small-subunit 
processome¢’ has revealed that U3 acts as a 
molecular guide that docks the processome 
onto the precursor rRNA by forming four 
evolutionarily conserved duplexes (hinges) 
between the two components: two hinges 
ina highly branched region of the precursor 
rRNA, and two ina region that will become the 
mature 18S rRNA. These hinges are a prerequi- 
site for three cleavage events, mediated by an 
RNA-cleaving nuclease enzyme, that release 
the 18S rRNA ready to make the small subunit. 

Shao etal. show that DNA-PK and KU primar- 
ily interact with U3 at this hinge region. Thus, 
muchas DNA-PKcs recruits the DNA-cleaving 
enzyme Artemis during the NHE] processing 
of DNA ends’, with U3, DNA-PKcs might also 
help to recruit specific RNA-cleaving nucleases 
(such as UTP24) to the small-subunit pro- 
cessome to cleave the precursor rRNA for 
ribosome construction. 

Structural studies suggest that the binding 
of DNA-PKcs to KUand DNA could regulate the 
activation of DNA-PKcs kinase activity alloster- 
ically, that is, by changing the conformation of 
the enzyme®”. In the future, it will be inter- 
esting to compare RNA- and DNA-dependent 
conformational changes in DNA-PKcs. The 
physiological relevance of the broad array of 
RNA partners identified by Shao etal. in their 
irCLIP analysis also remains to be dissected. 

Shao and colleagues’ study has identified 
an interesting player in ribosome assembly 
that might efficiently couple DNA DSB repair 
with processing of precursor rRNA, which is 
highly transcribed from the naturally unstable 
ribosomal DNA template. Broadly, the find- 
ings encourage us to critically evaluate how 
dynamic redistribution of DNA-PK might 
allow the cell to couple DSB repair with the 


regulation of protein synthesis. And, although 
further studies are required, we might have 
taken a step closer to deciphering the 
mysterious ribosomopathies. 
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Tiny fossil sheds light on 
miniaturization of birds 


Roger B. J. Benson 


A tiny skull trapped in 99-million-year-old amber suggests that 
some of the earliest birds evolved to become miniature. The 
fossil illustrates how ancient amber can act as a window into 


the distant past. See p.245 


Dinosaurs were big, whereas birds — which 
evolved from dinosaurs — are small. This var- 
iation is of great importance, because body 
size affects lifespan, food requirements, sen- 
sory capabilities and many other fundamental 
aspects of biology. The smallest dinosaurs! 
weighed hundreds of grams, but the smallest 
living bird, the bee hummingbird (Mellisuga 
helenae)’, weighs only 2 grams. How did this 
difference come about, and why? On page 245, 
Xing et al? describe the tiny, fossilized, bird- 
like skull of a previously unknown species, 
which they name Oculudentavis khaungraae. 


The discovery suggests that miniature body 
sizes in birds evolved earlier than previously 
recognized, and might provide insights into 
the evolutionary process of miniaturization. 

Fossilization of bones in sediments such 
as clay, silt and sand can crush and destroy 
the remains of small animals, and can flatten 
and decay soft parts suchas skin, scales and 
feathers. By contrast, preservation of small 
animals in Burmese amber (which formed 
from the resin flows of coniferous trees about 
99 million years ago) helps to protect their soft 
parts. A wide range of invertebrates‘ and small 


Figure 1| Computed tomography scan of the skull of Oculudentavis khaungraae. Xing et al.’ have 
characterized this 99-million-year-old fossil bird. 
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Figure 2 | Different size ranges of dinosaurs and birds. Dinosaurs varied from about 500 grams to many 
tonnes in weight. By contrast, the first birds were much smaller. The smallest fossil bird found so far from 
the Cretaceous period weighs in at about 12 grams (data taken from ref. 9). Xing et al.3 report that the tiny 
Oculudentavis weighed just 2 grams. This discovery provides new insight into the lower limits of vertebrate 


body size in the age of dinosaurs. 


vertebrates, including lizards° and birds®, have 
been found in Burmese amber. Specimens 
preserved inthis material are rapidly emerging 
as an exceptional way to study tiny vertebrates 
from the age of dinosaurs**. 

It is in Burmese amber that the single 
known fossil skull of Oculudentavis has been 
preserved (see Fig. 1a of the paper?). Oculuden- 
tavis means eye tooth bird, aname that Xing 
et al. chose because of two unusual features 
of the skull, each of which provides evidence 
about the likely lifestyle of this 99-million- 
year-old species. 

First, the skull is dominated by two 
enormous eye sockets containing scleral 
ossicles — rings of bone that form the eye 
skeletons of birds (Fig. 1). The opening at the 
centre of these ossicles is narrow, restricting 
access for light into the eye and providing 
strong evidence that Oculudentavis was active 
in well-lit, daytime environments. 

Second, the jaws of Oculudentavis have 
many smallteeth. This might seem odd, given 
the absence of teeth in today’s birds, but teeth 
arein fact common among early fossil birds’. 
However, Oculudentavis has more teeth than 
other birds of the period, and these extend 
unusually far back in the jaws to a point just 
under the eye. On the basis of these facts, along 
with observations of the fossilized tongue, the 
authors suggest that Oculudentavis was a pred- 
ator that mainly ate invertebrates. This diet 
differs considerably from the nectar-based 
diet of the smallest living birds, and suggests 
that extinct and living birds took different 
paths to miniaturization (although how diet 
might be involved in this process remains 
unknown). 

Oculudentavis is just one fossil species. 
However, even single fossils can contribute 
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greatly to our understanding of the history 
of life on Earth. In this case, weighing perhaps 
2 grams, Oculudentavis is about one-sixth of 
the size of the smallest known early fossil bird’. 
This indicates that, only shortly after their ori- 
gins late in the Jurassic period (which lasted 
from about 201 million to 145 million years 
ago), birds had already attained their min- 
imum body sizes. By contrast, the smallest 
dinosaurs weighed hundreds of times more! 
(Fig. 2). Understanding when, how and why 
the lower limits of body size shifted in this way 
requires greater knowledge of the earliest fos- 
sil birds. But Oculudentavis is a stepping stone 
towards this. 

The evolutionary relationships between 


“Even single fossilscan 
contribute greatly to our 
understanding of the history 
oflifeonEarth.” 


Oculudentavis and other dinosaurs and birds 
are difficult to determine, but are central to 
clarifying the evolutionary implications of this 
discovery. Xing and colleagues’ analysis sug- 
gests two possibilities. Oculudentavis could 
belong to the most common group of birds 
of the Cretaceous period (about 145 million 
to 66 million years ago), the enantiornithines. 
Alternatively, it could be much more closely 
related to dinsosaurs, lying almost midway on 
the evolutionary tree between the Cretaceous 
birds and Archaeopteryx, the iconic winged 
dinosaur from the Jurassic. 

This confusion is a result of the bizarre 
features seen in Oculudentavis. These include 
many characteristics that differ from those of 
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other birds, such as more-robust, fused bones, 
and proportionally enlarged sensory organs 
relative to the overall body size. The authors 
suggest that these features could have arisen 
fromthe constraints of evolutionary miniatur- 
ization or from ecological specialization. Both 
of these might have required Oculudentavis 
to have astrengthened skull and proportion- 
ally large eyes to maintain sensory capacity 
at sucha tiny size. In addition, Oculudentavis 
has features that are not seen in dinosaurs or 
birds, but are presentin lizards — these include 
the spoon shape ofits scleral ossicles and the 
fact thatits teeth are attached tothe jaw bone 
by their sides, rather than being implanted in 
sockets. The challenge of determining how 
Oculudentavis is related to other early birds 
and bird-like dinosaurs would be greatly 
assisted by knowing more about its skeleton. 

The past decade has generated much data 
on the dinosaur-bird transition, greatly 
advancing our understanding of this major 
evolutionary event7®. In the past few years, 
Burmese amber has yielded surprising 
insights, including previously unseen feather 
and skeletal structures in other extinct birds®. 
The study of small vertebrates preserved in 
amber, their ecosystems and their evolu- 
tionary relationships with one another is in 
anascent phase. But Oculudentavis suggests 
that the potential for continued discovery 
remains large — especially for animals of 
diminutive sizes. 
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Computer technology 


Anelectrically operated 
magnetic logic gate 


See-Hun Yang 


Bits of a logic gate can be encoded by differently magnetized 
regions. A method has been developed in which the walls 
between these domains are manipulated electrically, rather 
than magnetically, to produce a logic gate. See p.214 


Computer logic based on standard electronic 
components is expected to hit its speed and 
power limits before long. To get around this, 
logic devices that use magnetic elements, 
rather than electronic ones, have been pro- 
posed — but these usually require external 
magnetic fields, which limits their applica- 
tion. A working logic gate that uses magnetic 
elements and is driven completely electrically, 
without the need for an external magnetic 
field, has yet to be demonstrated!. However, 
on page 214, Luo et al.” report that they have 
harnessed the chirality (‘handedness’) ofa sys- 
tem to invert the direction of magnetization of 
domains ina cobalt wire purely by means of an 
electric current. The resulting inverter device 
acts as a ‘NOT’ gate, which they use to build 
up other logic gates such as NAND and NOR. 

First, some background. Magnetic domains 
are small regions of uniform magnetization 
in a material. The narrow boundaries sepa- 
rating different magnetization orientations 
are called domain walls; within the wall, the 
magnetization must gradually twist around. 
Such magnetic domains have long been used 
to encode data bits for storage, such as in hard 
disk drives, in which mechanical motion is 
needed to access the data. 

By contrast, inmagnetic racetrack memory? 
(whichis still under development), no mechan- 
ical movement is needed. Instead, magnetic 
domains within hair-thin metal wires are 
moved by the flow of electric current in the 
wires. The trick to doing this lies in an inter- 
action — the spin-transfer torque — between 
the magnetic moments of the domain walls 
and the spins of the moving electrons. Here, 
spin refers to aquantum property of the elec- 
trons; moment is the magnetic direction and 
strength; and the torque is a twisting inter- 
action that tends to rotate the moments and 
thus move the walls. 

In spin-transfer systems, the torque arises 
from current in the magnetic wire itself. But 
another type of torque, known as spin-orbit 
torque, is produced when the wire is placed 
ona layer of anon-magnetic heavy metal such 


as platinum, and the electric current flows in 
that layer instead (Fig. 1). In such systems, 
two effects add to each other to drive twist- 
ing in the domain walls more efficiently than 
in spin-transfer systems*». The first is that a 
spin currentarises in the non-magnetic heavy- 
metal layer; the second effect, whichis crucial 
to Luo and colleagues’ work, is that an inter- 
action between the two metals forces aspecific 
chirality on the domain walls. 
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Chirality means handedness: like left and 
right hands, objects of opposite chirality 
have mirror symmetry, but cannot be super- 
imposed on each other. Domain walls known 
as Néel-type walls can exhibit chirality when 
the direction of their magnetic moment is 
reversed, because there are two ways in which 
reversal can happen: the moment can undergo 
a right-handed twist or a left-handed one. 
(Moments canalso spin in the plane of domain 
walls known as Blochwalls, but this mechanism 
isnot used in Luo and co-workers’ study.) 

The chirality is produced by an effect called 
the Dzyaloshinskii-Moriya exchange inter- 
action (DMI), whichacts between the magnetic 
and non-magnetic metals. The DMI both estab- 
lishes a specific chirality and favours the gen- 
eration of the correct type of wall — a Néel wall 
—inLuo and colleagues’ set-up. Researchers 
have previously recognized that chiral domain 
walls could be used in logic operations®, but an 
all-electrical working logic has been hampered 
by the lack of a reliable working inverter, the 
key component for logic operations. 

Luo et al. have ingeniously invented an 
inverter that flips the magnetic moments 
associated with incoming bits of data using 
spin-orbit torque. Building on their previous 


Fixed 
domain wall 


Figure 1| How to flip magnetic data bits electrically. Luo and co-workers? have produced a NOT logic gate 
that flips the direction of magnetization of magnetic domains by electrical means. In the authors’ system, 
mobile domains of a cobalt wire that have up or down magnetic moments act as data bits. From the side, 
spheres with arrows represent the direction of magnetization; onthe top surface, circled dots and crosses 
indicate up and down moments, respectively. a, The authors fabricated a fixed wall between up and down 
domains, in which the magnetization direction can rotate ina left- or right-handed direction — aproperty 
called chirality. b, An electric current in the platinum substrate propels another, moving chiral domain wall 
(blue) along the wire. c, When two opposite magnetic moments collide on the left-hand side of the fixed wall, 
the direction of the moment in the fixed region switches. Anew domain forms on the other side of the wall, 
in which the moment is reversed, preserving the preferred chirality of the system. Overall, the bit has been 


flipped. (Adapted from Fig. 1 of the paper.) 
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work on chirally coupled nanomagnets’, the 
authors fabricated asort of artificial, station- 
ary domain wall in a magnetic cobalt wire 
interfaced with non-magnetic platinum. The 
magnetization inthe cobalt is perpendicular to 
the plane of the wire, except in the stationary 
region. There, it is magnetized in the direc- 
tion of the wire’s long axis, like the region in 
the middle of an ordinary domain wall, but 
across a much larger width. This is crucial, 
because it allows smaller coercivity — that is, 
the magnetization here is easier to switch. 

To picture how the inverter works, consider 
an input consisting of a domain wall that has 
left-handed chirality (Fig. 1). This mobile wall 
is rolled along the wire by spin-orbit torque. 
Whenit reaches the fixed artificial boundary, 
two opposite magnetic moments collide, 
producing a region of the wire in which the 
moment changes abruptly. According to the- 
ories of magnetism, such an abrupt change has 
a high energy cost. To lower the energy of the 
system, one of the moments must be switched, 
or anew magnetic domain must be generated. 
In this case, the moment inthe low-coercivity 
fixed wall switches to the same direction as 
that in the incoming wall. 

But achirality effect now comes into play: 
this switch of magnetic moment produces 
a right-handed chirality at the other side of 
the fixed wall that conflicts with the chirality 
preferred by the DMI. To resolve this, anew 
domain wall forms on that side (the system is 
shaped in such a way as to promote this pro- 
cess) and sets off along the wire. The moments 
inthe resulting outgoing bit thus have the pre- 
ferred left-handed chirality, rather than the 
right-handed chirality originally produced at 
the wall. 

By integrating their inverters into junctions, 
Luo et al. designed some simple logic gates 
(NAND and NOR), as well as more-complicated 
ones (such as exclusive-OR). Eachjunction has 
twoinputs, an intrinsic bias towards one mag- 
netic moment and one output. The output is 
determined by thetwo inverted inputs and by 
the bias at the junctions (rather like a ‘majority 
gate’*). So, when the inputs and bias are (0,0) 
and 1, respectively, inverters immediately 
before the junction invert them to (1,1) and 
0 at the junction itself, which consequently 
outputs 1, acting as a NOT gate. But when the 
inputs are either (1,0) or (0,1), the value of 
the bias determines whether the gate behaves 
as a NOR or a NAND. This majority-gate 
behaviour mitigates the need to precisely 
synchronize the two inputs, offering reliable 
logic operations. 

This logic system satisfies key criteria 
known as cascadability and fan-out. Cascad- 
ability means that the output of one gate is 
produced in the correct form and is strong 
enough to drive the input of the next gate. And 
fan-out means that one gate output can be con- 
nected to several gate inputs’. Moreover, the 
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data can be stored inthe absence of an external 
power source, and evade damage by ionizing 
radiation. 

Challenges remain before chips based on 
Luo and co-workers’ system can reach the 
market. The operating current will need to 
be reduced so that it can be accommodated 
by tiny complementary metal oxide semi- 
conductor (CMOS) transistors, which help to 
pick upinputs and outputs for usein chips. In 
theory, current decreases as the size of wires 
and transistors decreases, and so current 
density (the charge per unit time that flows 
through a given cross-section of the wire) 
remains constant with scaling. A reductionin 
current density will be needed toincrease the 
speed and reduce the energy consumption 
of the authors’ system. Domain-wall velocity 
does not scale linearly with current, and sonew 
materials might need to be used toreduce the 
current density!°". 

Another issue is that the input and output 
states of Luo and colleagues’ system have to 
be detected by microscopy, rather than by an 
electrical method. A different read-out system 
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will be needed for practical applications, but 
this could be technically challenging. Aneffect 
known as tunnelling magneto-resistance 
might offer one solution”. The implementa- 
tion of a domain-wall logic chip that uses an 
electrically driven read-out system should 
be the next goal, following on from Luo and 
colleagues’ exciting discovery. 
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Research onthe 


cerebellum yields rewards 


Jennifer L. Raymond 


A brain structure called the cerebellum has mostly been 
associated with learning from errors. The discovery that 

the cerebellum is also involved in reward-driven learning in 
monkeys implies a previously unappreciated role in cognition. 


People and organizations alike use rewards, 
fromsnackstosalarybonusesandfrequent-flyer 
miles, to shape behaviour through a process 
called reinforcement learning. For example, 
if a dog receives a treat for rolling over in 
responsetoa verbal command, the likelihood 
of that behavioural response tothe verbal cue 
willincrease. Writing in Neuron, Sendhilnathan 
and colleagues! describe neuronal signals that 
could support such reward-driven learning. 
Whatis remarkable is where the authors found 
these signals — not in the brain areas that 
have long been implicated in reinforcement 
learning, but in the cerebellum, a brain struc- 
ture historically associated with error-driven, 
rather than reward-driven, learning. 

The cerebellum is best known for its rolein 
motor-skill learning — the process by which 
movements become smooth and accurate 
through practice. Fifty years of research? 
supports the idea that when you practise a 
movement, such as your tennis backhand, 
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the cerebellum uses feedback about errors to 
gradually refine the accuracy of the movement 
by weakening the neuronal connections that 
are responsible for those errors. It has been 
widely assumed that the cerebellum uses a 
similar, error-correcting learning algorithm 
to support cognition’, because the regions of 
the cerebellum that contribute to cognitive 
functions suchas navigation‘ and social behav- 
iour’ have the same basic circuit architecture 
as those that control movement. 

In the past three years, however, there has 
beenaflurry of studies showing reward-related 
neuronal activity in the cerebellum* ”. What 
are reward signals doingin an error-correcting 
part of the brain? Sendhilnathan et al. lever- 
aged the rapid learning abilities of monkeys 
to gain fresh insights into reward-related 
signalling in the cerebellum. 

In each experimental session, the authors 
presented a monkey with two visual cues it 
had never seen before on a computer screen. 
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enough to drive the input of the next gate. And 
fan-out means that one gate output can be con- 
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data can be stored inthe absence of an external 
power source, and evade damage by ionizing 
radiation. 

Challenges remain before chips based on 
Luo and co-workers’ system can reach the 
market. The operating current will need to 
be reduced so that it can be accommodated 
by tiny complementary metal oxide semi- 
conductor (CMOS) transistors, which help to 
pick upinputs and outputs for usein chips. In 
theory, current decreases as the size of wires 
and transistors decreases, and so current 
density (the charge per unit time that flows 
through a given cross-section of the wire) 
remains constant with scaling. A reductionin 
current density will be needed toincrease the 
speed and reduce the energy consumption 
of the authors’ system. Domain-wall velocity 
does not scale linearly with current, and sonew 
materials might need to be used toreduce the 
current density!°". 

Another issue is that the input and output 
states of Luo and colleagues’ system have to 
be detected by microscopy, rather than by an 
electrical method. A different read-out system 
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will be needed for practical applications, but 
this could be technically challenging. Aneffect 
known as tunnelling magneto-resistance 
might offer one solution”. The implementa- 
tion of a domain-wall logic chip that uses an 
electrically driven read-out system should 
be the next goal, following on from Luo and 
colleagues’ exciting discovery. 
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if a dog receives a treat for rolling over in 
responsetoa verbal command, the likelihood 
of that behavioural response tothe verbal cue 
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and colleagues! describe neuronal signals that 
could support such reward-driven learning. 
Whatis remarkable is where the authors found 
these signals — not in the brain areas that 
have long been implicated in reinforcement 
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ture historically associated with error-driven, 
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The cerebellum is best known for its rolein 
motor-skill learning — the process by which 
movements become smooth and accurate 
through practice. Fifty years of research? 
supports the idea that when you practise a 
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the cerebellum uses feedback about errors to 
gradually refine the accuracy of the movement 
by weakening the neuronal connections that 
are responsible for those errors. It has been 
widely assumed that the cerebellum uses a 
similar, error-correcting learning algorithm 
to support cognition’, because the regions of 
the cerebellum that contribute to cognitive 
functions suchas navigation‘ and social behav- 
iour’ have the same basic circuit architecture 
as those that control movement. 

In the past three years, however, there has 
beenaflurry of studies showing reward-related 
neuronal activity in the cerebellum* ”. What 
are reward signals doingin an error-correcting 
part of the brain? Sendhilnathan et al. lever- 
aged the rapid learning abilities of monkeys 
to gain fresh insights into reward-related 
signalling in the cerebellum. 

In each experimental session, the authors 
presented a monkey with two visual cues it 
had never seen before on a computer screen. 


Cue1 
Lift left hand 
to get reward 


Cue 2 
Lift right hand 
to get reward 


Fruit juice 
reward 


Monitor 
neural activity 


Cerebellu m- 


Computer 
screen 


Figure 1| A rewarding choice. a, Sendhilnathan et al.’ examined neural activity in a brain region called the 
cerebellum during reward-driven learning. The authors presented monkeys with two visual cues. For one 
cue, the monkey needed to lift its left hand to receive a reward of fruit juice; for the other, lifting the right 
hand would lead to a reward. b, The monkeys performed a series of trials in which they were presented with 
one of the two cues. The authors monitored cerebellar neuronal activity while the monkeys learnt, through 
trial and error, which response would produce a reward for each cue. They found that a subpopulation 

of neurons carried information about the success or failure of the previous trial until the next trial was 
completed (not shown). (Figure adapted from Fig. S2 of ref. 1.) 


One arbitrarily assigned cue would result in 
the monkey receiving a reward of fruit juice if 
the animal responded by lifting its left hand. 
The other cue would result in a reward if the 
monkey lifted its right hand. The research- 
ers monitored the activity of neurons called 
Purkinje cells inthe cerebellum asthe monkeys 
learnt, through trial and error, to make the 
correct response to each visual cue (Fig. 1). 

Sendhilnathan etal. found that the activity 
of cerebellar Purkinje cells carried information 
about the success or failure of the monkey’s 
most recent attempt at the task. One sub- 
population showed high activity following a 
correct response to the cue; another showed 
high activity following a failed attempt. These 
signals arose afew hundred milliseconds after 
the end of atrial and persisted until the next 
trial was completed. As such, they seemed to 
providea working memory that could enable 
the outcome of one trial to guide the next 
behavioural choice. 

These signals are reminiscent of those 
carried by neurons in frontal and parietal 
regions of the brain’s cerebral cortex, which 
encode the ‘value’ of different behavioural 
choices on the basis of reward history over 
multiple trials’. In the current study, the 
cerebellar neurons kept track of only the 
most recent trial’s outcome. But in this task, 
the outcome of a single trial provides suffi- 
cient information for the monkey to infer 
the correct response for the next trial — ifa 
reward was not given when a monkey lifted 
its right hand in response to one visual cue, 
for instance, then the correct response to that 
cue must be to lift the left hand, and the correct 
response to the other visual cue would be to 
lift the right hand. It would be interesting to 
know whether cerebellar neurons can keep 


track of amore-extended history of rewards 
should the task require it, and whether the 
cerebellum interacts withthe cerebral cortex 
in performing this computation. 
Importantly, information about the previous 
trial’s outcome was present in the cerebellum 
only when anewset of cue-response associa- 
tions was being learnt. As monkeys improved 
their performance over trials, the neuronal 
activity encoding each outcome waned. More- 
over, the signal was not present when monkeys 
earned rewards by responding to a pair of 
visual cues that they had mastered through 
several months of training. These observa- 
tions indicate that cerebellar neurons are not 
simply carrying information about rewards, 


“The cerebellum seemsto 
contribute specifically to 
learning about howto earn 
rewards in anew situation.’ 


predictions about rewards or the movements 
that animals make when anticipating rewards. 
Rather, the cerebellum seems to contribute 
specifically to learning about how to earn 
rewards ina new situation. The authors spec- 
ulate that the cerebellum might enhance the 
rate of learning about rewards, a possibility 
supported by the recent discovery in rodents 
of direct, excitatory projections from the 
cerebellum to neurons in the brain stem that 
release the reward-associated neurochemical 
dopamine". 

There are several intriguing parallels 
between the signals found by Sendhilnathan 
and colleagues and the signals involved in 
cerebellar control of movement. First, as with 
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reward-driven learning, for some motor skills, 
cerebellar Purkinje cells contribute selectively 
to new motor learning and notto performing 
older motor skills*!°. Second, Purkinje-cell 
activity carries information that could guide 
both ongoing behaviour and the induction 
of learning during motor- and reward-based 
learning’. Third, the Purkinje cells carry sig- 
nals that could support working memory in 
the form of activity maintained from one trial 
to the next in reward-based learning, and in 
the form of activity maintained during a delay 
period between a cue andthe motor response 
tothe cue, which seems to support motor plan- 
ning™"®, Finally, during bothtypes of learning, 
individual Purkinje cells are active for aspecific 
time period of a few hundred milliseconds, 
with information seemingly passed from cell 
tocellover time”. These striking parallels raise 
the possibility that the cerebellum performs 
a similar function for error-driven motor 
learning and reward-driven reinforcement 
learning. 

We learn from both our successes and our 
failures. These two learning schemes were 
previously attributed to distinct brain struc- 
tures, but the current results, along with those 
of others® ”, blur these mechanistic and con- 
ceptual boundaries. As such, the work high- 
lights the need to consider how long-range 
interactions between brain areas support the 
shaping of behaviour by experience. 
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Nuclear spins are highly coherent quantum objects. In large ensembles, their control 


and detection via magnetic resonance is widely exploited, for example, inchemistry, 
medicine, materials science and mining. Nuclear spins also featured in early proposals 
for solid-state quantum computers’ and demonstrations of quantum search? and 
factoring? algorithms. Scaling up such concepts requires controlling individual 
nuclei, which can be detected when coupled to an electron**. However, the need to 
address the nuclei via oscillating magnetic fields complicates their integration in 
multi-spin nanoscale devices, because the field cannot be localized or screened. 
Control via electric fields would resolve this problem, but previous methods’ ® relied 
on transducing electric signals into magnetic fields via the electron-nuclear hyperfine 
interaction, which severely affects nuclear coherence. Here we demonstrate the 
coherent quantum control of asingle “Sb (spin-7/2) nucleus using localized electric 
fields produced within a silicon nanoelectronic device. The method exploits an idea 
proposed in1961"° but not previously realized experimentally with a single nucleus. 
Our results are quantitatively supported by a microscopic theoretical model that 
reveals how the purely electrical modulation of the nuclear electric quadrupole 
interaction results in coherent nuclear spin transitions that are uniquely addressable 
owing to lattice strain. The spin dephasing time, 0.1 seconds, is orders of magnitude 
longer than those obtained by methods that require a coupled electron spin to 
achieve electrical driving. These results show that high-spin quadrupolar nuclei could 
be deployed as chaotic models, strain sensors and hybrid spin-mechanical quantum 
systems using all-electrical controls. Integrating electrically controllable nuclei with 


quantum dots” 


could pave the way to scalable, nuclear- and electron-spin-based 


quantum computers in silicon that operate without the need for oscillating magnetic 


fields. 


Nuclear magnetic resonance (NMR) relies onthe presence ofa static 
magnetic field, By, that separates the energy levels of the nuclear 
spins, and aradio-frequency (RF) oscillating magnetic field, B,, that 
induces transitions between such levels. Magnetic fields cannot be 
easily confined or screened at the nanoscale. Therefore, identical 
nuclear spins within large regions would all respond to the same 
signal, preventing the spins from being individually addressed. Elec- 
tric fields, instead, can be efficiently routed and confined within 
highly complex nanoscale devices, with a prime example being the 
sophisticated interconnects found in modern silicon computer chips. 
These observations suggest that an ideal route to scale up nuclear- 
spin-based quantum devices would involve the use of RF electric 
fields for spin control. 


A theoretical idea crucial to this strategy was proposed by Bloember- 
gen as early as 1961”: for nuclei with spin/> 1/2 and non-zero electric 
quadrupole moment gq,, a resonant electric field induces nuclear spin 
transitions by modulating the nuclear quadrupole interaction, if the 
nuclei are placed in solids that lack point-inversion symmetry at the 
lattice site. Inbulk ensembles, the static shift of the NMR frequency by 
ad.c. electric field, named linear quadrupole Stark effect (LOSE), was 
observed inthe 1960s". The resonant version of LQSE, called nuclear 
electric resonance (NER) was demonstrated only recently“ in a bulk 
gallium arsenide (GaAs) crystal. 

We report here the demonstration of NER and coherent electrical 
control of a single antimony (”°Sb) nucleus in silicon (Si). The discovery 
that this nucleus could be electrically controlled was fortuitous. The'?Sb 
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Fig. 1|!°Sb nuclear spin ina silicon device. a, False-colour scanning electron 
micrograph of the silicon metal-oxide-semiconductor device used in the 
experiment. Note the gaps inthe nominally short-circuited antenna 
terminations. S, source; D, drain; SET, single-electron transistor. b, Energy-level 
diagram of the spin-7/2 nucleus of an ionized °Sb donor. The magnetic field By 
introduces a Zeeman splitting (green dashes), and the electric quadrupole 
interaction Q,, causesa further energy shift (blue dashes). c, Nuclear spin 


donor atom has anuclear spin of /=7/2 with electric quadrupole moment 
n= 0.69 b. Depending on its electrochemical potential relative toa 
nearby electron reservoir, an electron (with spin S=1/2) may be boundto 
the nucleus. The atom was implanted ina metal-oxide-semiconductor 
nanostructure (Fig. 1a) fabricated on isotopically enriched *8Si (for fabri- 
cation details, see Supplementary Information section 3), similarto those 
developed for phosphorus (!P) spin qubits*>"*. The structure contains 
asingle-electron transistor (SET) for single-shot electron spin readout, 
which is based on energy-selective electron tunnellingintoa cold charge 
reservoir”. Four electrostatic gates control the electrochemical potential 
of the donor, and a broadband on-chip microwave antenna’ delivers 
coherent control signals tothe donor spins. Single-shot, quantum non- 
demolition nuclear spin readout’ is obtained by combining single-shot 
electron readout with selective excitation at a specific electron spin 
resonance frequency, which depends on the nuclear state because of the 
strong hyperfine interaction (see Methods). The antenna is nominally 
terminated by ashort circuit, in order to obtain maximum current atits 
tip and produce strong oscillating magnetic fields to control both the 
electron (at about 40 GHz) and the nuclear (at about 10 MHz) spins of 
the donor. In this device, however, an electrostatic discharge damaged 
the short-circuit termination (Fig. 1a). Although the small gap in the 
termination had a low enough impedance at 40 GHz, to allow current 
flow for electron spin resonance, at about 10 MHz it produced solely 
anRF electric field. Once we realized that NER was possible, we began 
to use the electric gates fabricated exactly above the donor, which had 
aneven stronger effect. 

We focus here on the “Sb donor in its ionized state; the removal of 
the donor-bound electron precludes any interpretation of the data 
involving modulation of hyperfine fields”’. The electron is introduced 
only for the final readout phase. 

In nanoscale Si devices, the aluminium (Al) gates can cause consid- 
erable lattice strain at low temperatures, owing to the different ther- 
mal contraction of Aland Si (ref. '8). Lattice strain creates an electric 
field gradient (EFG) of V.g= 0°V/dadB (Vis the electric potential and 
a, B € {x, y, z}) at the nuclear site”°”! (Fig. 1b), which produces a static 
nuclear quadrupole interaction Q,,=€q,V_p/[2/(2/- 1)h] (his the Planck 
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f, 


5/2 & 7/2 “2% 


transition frequencies as a function of Q,,. Anon-zero Q,, results in seven 
individually addressable nuclear resonances. The m,=~-1/2 «+ +1/2 transition 
(dashed blue line in c) is forbidden in NER. The measured quadrupole splitting 
f= 66 kHz is indicated bya dashed purple line. d, Shear strain in the silicon 
substrate, calculated ona vertical cross-section under the orange dashed line 
ina. 


constant and eis the electron charge), resulting ina quadrupole split- 
ting fg of the nuclear resonance frequencies (Fig. 1d), making all transi- 
tions individually addressable. 

The application of an RF electric field of amplitude £, modulates the 
nuclear quadrupole energies by 5Q,, and 5Q,,,and induces transitions 
between nuclear states ata rate of f pea  |8Q.,(m, - URL + Lilm,)1, 
where m,is the secondary spin quantum number, ranging from —/to/ 
insteps of 1, and /,, /,, /,are the eight-dimensional operators describing 
thex, y, z projections of the /=7/2 spin. Notably, the transition rate is 
predicted to be zero for the m,=—1/2 © +1/2 transition (see equation 
(15) in Supplementary Information section 2C),a consequence of the 
selection rules of electric quadrupole transitions. Because the quad- 
rupole interaction is quadratic in the nuclear spin operators, first-order 
transitions between spin states that differ by Am, = +2 are allowed. 
These occur at a rate Of a,  |6Q,,.(m, - 2\F2\m))| (see equation 
(19) in Supplementary Information section 2C) and, importantly, all 
Am, = +2 transitions have a non-zero rate. 

Figure 2a shows the experimental NER spectrum for Am,= +1 transi- 
tions, which contains six sharp resonances separated by f, = 66 kHz. 
The m,=~1/2 « +1/2 transition is absent, as expected from NER. All 
six predicted Am, =+2 transitions are observed (Fig. 2b). The ability 
to excite the m,=—1/2 < +3/2 transition was used to ‘jump over’ the 
forbidden m,=-—1/2 © +1/2 transition and observe the Am,= +1 transi- 
tions at negative m, which would otherwise be inaccessible if starting 
froma positive m,. Similarly, the NER spectrum for Am,= +2 transitions 
(Fig. 2b) could be completed only by employing a Am,= +1 transition. 

Figure 2c, d presents the observed transition rates between each 
pair of states, in excellent agreement with the predicted trends from 
NER theory. Using NMR, the Rabi frequencies for the Am,= +1 transi- 
tions would be ‘guacho « |yB,(m, - Ufm,)| (n= 5.55 MHz Tis the 
nuclear gyromagnetic ratio), which is notably maximal for the 
m,=-1/2< +1/2 transition. The Am,=+2 NMR transitions are forbidden 
to first-order. These results prove decisively that our experiments do 
not constitute a form of magnetic resonance. 

As observed in earlier experiments on*P (refs. °”), the nuclear spins 
of ionized donors in 78Si have exceptional quantum coherence 


Fig.2| Nuclear electricresonance. 
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a,b, NERspectrum for the Am,=+1 (a) and 
Am,=+2(b) transitions, obtained by 
applying voltage V£*° toa donor gate (see 
Fig. 1a). The m,=-1/2 «+ +1/2 transition 

(a, red) was not observed, as expectedin 
NER. To acquire the complete Am,=+1 
spectrum, the m,=—1/2  +3/2 transition 
was used to bridge the positive and 
negative m,values. Py, represents the 
probability of flipping the nuclear spin 
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values. All Rabi frequencies closely follow 


|3/2) © |7/2) the NER prediction, including the absence 
T 


of the m,=~-1/2 < +1/2 transition (red 
circle inc, reddots ina), and are 
incompatible with NMR. e, f, Nuclear Rabi 
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properties. We performed a Ramsey experiment (Fig. 2g) on the 
m,= +5/2 < +7/2 (Am,= +1) transition to extract the pure dephasing 
time73,,,(+5/2 © + 7/2) = 92(8) ms (68% confidence level), which cor- 
responds to an NER broadening (full-width at half-maximum) of 
T= |n2/(1T 3,4) = 2.4(2) Hz. The m, = +3/2 @ +7/2 (Am, = +2) transition 
has shorter dephasing time, 73,,,(+3/2 © + 7/2) =28(1) ms (Fig. 2h). 
Both values, although extremely long in absolute terms, are noticeably 
shorter than thetime7,,, = 250-600 ms measured on the*'P nucleus in 
two other similar devices’® fabricated on the same *8Si wafer. Given that 
the*P nucleus has zero quadrupole moment, this suggests that the “Sb 
coherence may beaffected by electrical noise”, ina way that the “P coher- 
enceis not. Nonetheless, our dephasing time remainstwo orders of mag- 
nitude longer than that observed in*!P when adding ahyperfine coupled 
electron, 73,9% 430-570 ps (ref. '°) and three orders of magnitude 
longer than the observedT 5 = 64 psofaterbiumnucleusina single-atom 
magnet’. This observation highlights the benefit of a purely electrical 
control mechanism that does not rely on hyperfine interactions. 

We measured the Rabi frequencies of the Am,=+1and Am,=+2 NER 
transitions as a function of the amplitude of the RF voltage applied to 
the gate, finding transition rates g, , = 34.21(3) Hz mV" (Fig. 3a) and 
2¢2=1.995(4) Hz mV" (Fig. 3b). These transition rates show that NER 
is a weak effect but, owing to the long nuclear spin coherence in “°Si, 
we were able to perform high-fidelity Rabi flops persisting for tens of 
milliseconds (Fig. 2e, f). 

In addition to driving nuclear spin transitions with an RF voltage, we 
were able to apply Stark shifts to the resonance frequencies using an 


confidence level. 


T (ms) 


additional d.c. voltage AV 82 on the gates (Fig. 3c, d). All NER frequen- 
cies shifted according to Af= (A /OVBC") Am lle, ~— (Am,/2)]AVEe" , 
where Of, /@VBe° = 9.9(3) Hz mV *, and |Am|[m, — (Am,/2)] is a factor 
of order unity that represents the matrix element of the electric quad- 
rupole interaction between the initial and final state of each transition 
(see Supplementary Information section 2C for details). 

The results reported here constitute the first, to our knowledge, 
observation of coherent, purely electrical control of a single nuclear 
spin. Achieving this in silicon is, at first sight, remarkable: no effect 
of electric fields on nuclear spins has ever been observedin anon-polar, 
non-piezoelectric materialin the absence ofahyperfine-coupled electron. 
To gain a microscopic understanding of this phenomenon, we conjec- 
tured that our results are a form of LQSE”. Resonant transitions between 
nuclear levels induced by electric fields (NER) require that the crystal does 
not possess point-inversion symmetry at the atomic site”, as is indeed the 
case for silicon. The observation of individual NER transitions, separated 
by thenuclear quadrupole splitting/,, implies thata static EFG must exist 
atthe nuclear site. This requires breaking the 7, (tetrahedral) symmetry 
of thesilicon crystal, as it would otherwise have zero net EFG. For instance, 
uniaxial strain (for example, €,,) lowers the symmetry to D,, (tetrago- 
nal scalenohedral), whereas shear strain (for example, €,,—€,,) lowers it 
to C,, (rhombic pyramidal). The 7, symmetry can also be broken by an 
electric field that polarizes the atomic bonds. This latter effect explains 
both the observation of NER and the static shift of the nuclear spin 
resonance lines (Fig. 3c, d) due to LOSE on application of a static gate 
voltage. 
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Fig. 3| Linear quadrupole Stark effect. 
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The larger and charged donor atom introduces a local lattice distor- 
tion, displacing its four coordinating Si atoms by 0.2 A, and polarizes 
the charge density along the bonds (Fig. 4b, d). This, however, does not 
yet break the 7,symmetry. An EFG is obtained by further introducing 
strain. The S tensor that links EFG to strain has two uniquecomponents, 
Si, (uniaxial) and S,, (shear). We conducted a first-principles, density 
functional theory calculation and extracted S,, =2.4 x 10” Vm~7and 
S44=6.1x 10” V m7? (see Supplementary Information section 7C2 for 
details). Using a finite-element numerical model we computed the 
strain profile in our device, as caused by the different thermal expan- 
sions of Siand Al on cooling to cryogenic temperatures!*”! (Fig. 1b). 
Finally, we triangulated the most likely location of the ’Sb donor by 
combining the implantation depth profile with a model of the relative 
capacitive coupling between the donor and different pairs of control 
gates, extracted fromthe experimental charge stability diagrams (see 
Extended Data Fig. 4 and Supplementary Information section 7C3 for 
details). By combining these three pieces of information, we arrived 
at aspatial map of quadrupole splittings fg (Fig. 4c), which shows good 
agreement between the models and the experiment around the pre- 
dicted location of the donor under study. 

The effect of electric fields on the quadrupole interaction, both 
static (LQSE) and dynamic (NER), can be understood as arising from 
the single unique component of theR tensor, R,, (see Supplementary 
Information section 7D1 for details). By combining a finite-element 
model of the electric field in the device, the estimated *Sb* donor 
position and the experimental values of LQSE and NER Rabi frequen- 
cies, we extracted R,,=1.7 x 10” m™ (see Supplementary Information 
section 7D2 for details). The strength of this coupling is comparable 
to prior bulk measurements of LQSE on arsenic (As) in GaAs (ref. 4). 
This can be understood by observing that, although the Sb*-Si bond 
has a weaker ionic character than the Ga—As bond, R,, scales with atomic 
number, leading toa similar overall value. Given that our model agrees 
with the experiment within a factor of order unity and no alternative 
explanation comes within orders of magnitude of the results (see Sup- 
plementary Information section 7E for details), we conclude that we 
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have observed the manifestation of LQSE and NER ina single nuclear 
spin in silicon. 

Our results have substantial consequences for the development of 
nuclear-spin-based quantum computers and the design of nanoscale 
quantum devices. The Hilbert space of the/=7/2°Sb nucleus has eight 
dimensions. It can encode the equivalent of three quantum bits of 
information, allowing simple quantum algorithms” or quantum error 
correction codes”*, all using solely electric fields. The donor electron 
and nuclear spins combined forma ‘flip-flop’ qubit", controllable by 
electric-dipole spin resonance. This scheme normally requires amag- 
netic antenna to reset the nuclear state in the appropriate qubit sub- 
space. This need could be removed completely by using an electrically 
drivable high-spin nucleus such as Sb. A recent result showed that 
lithographic quantum dots in silicon can be entangled with nuclear 
spins and that the nuclear coherence canbe preserved while shuttling 
the electron between different dots”. Electron spin qubits in silicon can 
be coherently controlled by electric fields with high speed and high 
fidelity?” Adding the ability to electrically control quadrupolar nuclei 
paves the way to quantum computer architectures that integrate fast 
electron spin qubits with long-lived nuclear quantum memories while 
fully exploiting the controllability and scalability of silicon metal- 
oxide-semiconductor devices, without the complication of routing 
RF magnetic fields within the device. 

The experimental validation of a microscopic model of the relation 
between strain and quadrupole splitting, obtained in a functional 
silicon electronic device, suggests the use of quadrupolar nuclei as 
single-atom probes of local strain, which has a key role in enhancing 
the performance of ultra-scaled transistors”®. 

The NER methods and microscopic models presented here could 
be extended to the study of quadrupolar nuclei in materials such as 
diamond andsilicon carbide, where electrical and strain tuning of opti- 
cally addressable electronic spins has been demonstrated”. 

The observation of a large quadrupole splitting of fg =66 kHzina 
high-spin nucleus creates a platform in which to study quantum chaotic 
dynamics ina single particle”. This has further applications in quantum 
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Fig. 4| Microscopic origins of the quadrupole interaction. a, Valence charge 
density near the Sb‘ atom (gold) and its 16 closest Siatoms (black) with a charge 
density isosurface (red). The positive charge of the donor causes an 
asymmetric charge density along the Sb*-Si bond but, inthe absence of strain 
or external electric fields, the EFG at the °Sb site vanishes by symmetry. 

b, Shear strain displaces the Si atoms and covalent bonds neighbouring the 
23Sb nucleus, creating an EFG that results in a quadrupole shift.c, Quadrupole 
splitting fg, predicted by combining density functional theory calculations and 
finite-element simulations (see Supplementary Information section 7C for 
details). Black contours enclose the 68% and 95% confidence regions for the 
location of the donor, as obtained from capacitance triangulation and the 
donor implantation profile (see Supplementary Information section 7A for 
details). d, Electric fields applied via the gate voltage distort the charge 
distribution, resulting in both linear frequency shifts (LQSE) and coherent spin 
transitions (NER). e, Calculation of the NER Rabi frequencies caused by 
electrical EFG modulation (green lines), compared to experimental results fora 
Am,=+1 (dots) and aAm,=+2 (squares) transition. All fg,,; values are determined 
using a single parameter, R,,, calculated via finite-element modelling and 
electronicstructure theory. No free fitting parameters were used. 


information science, for example, because of the remarkable analo- 
gies between chaotic spin models and digital quantum simulations*. 

Although the strain in the present device is static, our work allows 
us to predict the nuclear Rabi frequencies that would arise from time- 
dependent strain (see Supplementary Information section 8 for details). 
A dynamical strain of about 5 x 10°8 would cause a Rabi frequency of 
10 Hz, comparable to both the inhomogeneous nuclear linewidth 
[,,~ 2.4 Hz and to the linewidth /, of high-quality silicon mechanical 
resonators in the megahertz range™. Therefore, it is conceivable that 
thestrong-coupling limit of cavity quantum electrodynamics might be 
achieved between a single nuclear spin and a macroscopic mechanical 
oscillator, adding a novel spin-mechanical coupling pathway to the 


toolbox of hybrid quantum systems for quantum information process- 
ing and precision sensing**. 
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Methods 


Fabrication 

The device was fabricated on a (100) p-type silicon wafer, with a 
900-nm-thick epitaxial layer of isotopically purified 78Si on top (con- 
centration of residual Si, 730 ppm). Metallic leads for the SET were 
formed using optical lithography and phosphorus diffusion. The sub- 
strate was subsequently covered with a 200-nm-thick field oxide, with 
a small central window (10 x 20 ppm”) containing a high-quality, ther- 
mally grown layer of SiO, witha thickness of 8nm. Using a combination 
of standard optical and electron-beam lithography techniques, the 
device was fabricated on this thin oxide window. First, a small 
(90 x 100 nm?) window was defined, through which Sb ions were 
implanted at an energy of 8 keV and a fluence of 2 x 10" cm”, corre- 
sponding to an average of 14 donors in the implantation window. 
Donors were activated using a rapid thermal anneal at 1,000 °C for 5s. 
Next, in two electron-beam lithography steps, the gates forming the 
SET, the donor gates and the microwave antenna were created using 
thermally evaporated aluminium and lift-off, with native aluminium 
oxide as the gate dielectric. Ohmic contacts to the n-doped SET leads 
were formed using optical lithography, evaporated Al and lift-off, fol- 
lowed by a forming gas anneal. A detailed step-by-step process flowis 
given in Supplementary Information section 3. 


Experimental setup 

Thesample was cooled to a temperature of 20 mK in a dilution refrig- 
erator (Bluefors BF-LD400) fitted with a superconducting magnet. 
During the measurements, arbitrary waveform generators (Signadyne 
M3201A and M3300A) were used to tune the donor electrochemical 
potential, generate NER pulses and IQ-modulate the microwave sig- 
nals generated by a vector microwave source (Keysight E8267D). The 
SET current was amplified with a transimpedance amplifier (FEMTO 
DLPCA-200 in combination with Stanford Instruments SIM911) and 
subsequently measured with a digitizer (Signadyne M3300A). Full 
details of the experimental setup, including a wiring schematic, can 
be found in Supplementary Information section 4. 


Nuclear spin readout 

The nuclear spin state is measured via electron spin readout. For 
nuclear spin readout, an electron is introduced to the donor by tun- 
ing its electrochemical potential about the Fermi level of the SET such 
that a spin-down electron tunnels onto the donor. The electron spin 
resonance (ESR) spectrum (Extended Data Fig. 1) shows eight distinct 
resonance lines, each corresponding toa single nuclear spin eigenstate. 
The electron spincan be flipped conditionally on the nuclear spin state, 
resulting in single-shot nuclear spin readout. Electron spin readout is 
achieved by spin-to-charge conversion through spin-dependent tun- 
nelling onto anSET and subsequent detection of the change in charge 
occupation of the donor (see Supplementary Information section 5 
for details). As each of these electron spin measurements project the 
nucleus into a single spin eigenstate, this is aquantum non-demolition 
measurement. Therefore, each single-shot nuclear spin readout can 
be repeated to increase the nuclear spin readout fidelity while retain- 
ing the single-shot nature of the nuclear spin measurement. An NER 
pulse has a probability P,,, of flipping the nuclear spin between two 
states. To measure P,,,, an NER pulse followed by nuclear spin readout 
is performed Nirerations times. The first record of the nuclear spin state 
is used as areference, and each subsequent record is compared tothe 
one before it. This reveals the number of times that the nucleus flips, 
Naips between the two spin states. Therefore, the flip probability is 
simply the number of flips per number of recorded attempts, that is, 
Paip = Noips/(Niterations -1). 


Theoretical modelling 
The spin Hamiltonian of the Sb nucleus takes the form: 


= V,Bol, Bs 


>| => 


> Quplals (1) 


a, Betx,y,z} 


where h= 6.626 x10) Hz ‘is the Planck constant, y, =—-5.553 MHz T* 
is the nuclear gyromagnetic ratio and B, = 1.496 T. In the presence of 
an RF electric field of amplitude F,, the Am,= +1 transitions are driven 
by an additional Hamiltonian term of the form: 


~ NER a 


Am,-tom, (O/A = cos(2nft)[6Q, (Ad + Lh) +6Q,,Gh+hi)1 (2) 


The Am,= +2 transitions are driven by a term of the form: 


~ NER 


Ay, 26m, (O/A = cos(2mft)[6Q,,f; + 5Q,/,+60,,h/,+2h)1  @) 
A detailed derivation of the matrix elements responsible for driving 
the Am,=+1land Am,=+2 NER transitions is given in Supplementary 
Information section 2C. A finite-element model is used to compute 
the strain and electric fields in the silicon layer near the donor posi- 
tion using the COMSOL multiphysics software. The donor position is 
triangulated by comparing simulated gate-to-donor coupling strengths 
with the experimentally observed strength, combined with the donor 
implantation profile (see Extended Data Fig. 4 and Supplementary 
Information section 7A). Kohn-Sham density functional theory is 
employed to calculate the components of the S tensor that describe 
the impact of strain on the EFG. To this end, 64- and 512-atom super- 
cells were strained using the PAW (projector augmented-wave) for- 
malism® with a plane-wave basis, as implemented in VASP (Vienna ab 
initio simulation package)****. The electric-field response tensor is 
estimated by comparing the data points from the d.c. LOSE (Fig. 3c, 
d) and Am,= +1 (Fig. 2c) and Am,= +2 (Fig. 2d) Rabi frequencies with 
the simulated electric fields at the triangulated donor position. The 
final R,, is found by minimizing the normalized residuals of the three 
separate R,, estimates. Full theoretical modelling details can be found 
in Supplementary Information section 7. 


Data availability 


All data necessary to evaluate the claims of this paper are provided in 
the main manuscript and Supplementary Information. Raw data files, 
data analysis code and simulation code are available at https://doi. 
org/10.26190/5de9c295a8821. 
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Extended Data Fig. 1| ESR spectrum ina magnetic field of By =1.496T. a, ESR 
frequencies for eight nuclear states. The average difference between 
successive ESR transition frequencies (black line) gives a hyperfine interaction 
of A=96.5 MHz, substantially lower than the bulk value of 101.52 MHz. One 


possible cause for this deviationis strain, which is known to modify the 
hyperfine interaction”. b, ESR spectral lines. For each nuclear state, the 
nucleus was initialized at the start of each microwave sweep, and adiabatic ESR 
pulses with 1 MHz frequency deviation were applied to excite the electron. 
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Extended Data Fig. 2| NER Rabioscillations onresonance. a, b, Nuclear Rabi 
oscillations measured with varying NER pulse duration, typ, While the pulse 
amplitude was fixed at V%),,=20 mV for Am,=+1 transitions (a) andVif..=40 mV 
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for Am,=+2 transitions (b). Black lines are non-decaying sinusoidal fits to the 


data, and error bars show the 68% confidence level. 
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Extended Data Fig. 3 | NERspectral line shifts for varying d.c. gate voltage. 
a,b, The spectral lines ofall Am,=+1 transitions (a) and Am,=+2 (b) transitions 
are measured while the d.c. gate voltage bias AV¢°is varied (columns) during 
the NER pulse. Wenote that this change in V8¢* is applied on top of large gate 
voltages, of the order of 0.5 V, which are necessary to electrostatically tune the 
device to enable its operation. The varying AV8¢° modifies the quadrupole 
interaction via the LQSE (see Supplementary Information section 7 for details), 
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resultingin shifts of the resonance peaks. A single fit to the resonance 
frequency shifts of all Am,=+1and Am,=+2 transitions (solid lines) gives an 
estimate of the gate-dependent quadrupole shift of Af, /avét* =9,9(3) HzmV7. 
From the top to the bottomtransition, the drive strengths V8 are 20 mV, 

20 mV, 25 mV, 25 mV, 20 mV and 25 mV for Am,= +1 (a) and 30 mV, 30 mV, 40 mV, 
40 mV,40 mVand 40 mV for Am,=+2 (b). Error bars show the 68% confidence 
level. 


a 
P(x,z) (1/nm?) 
40 - . 
0.025 
20 | 0.02 
= DFR 0.015 
E © Siege ; 
x< 
0.01 
-20 4 
0.005 
-40 Lb : ; ; DBR_J 0 
-40 -20 0 20 40 


z (nm) 


Extended Data Fig. 4 | Position triangulation of the Sb donor. The colour 
map shows the probability of finding the donorinacertainlocation. 

a,b, Probability density function found using a least-squares estimation 
comparing simulated gate-to-donor coupling strengths with the 
experimentally observed strengths (see Supplementary Information 

section 7A for details, including the locations of the donor gates DFR, DFL and 
DBR). To improve on the low resolving power of the triangulation method in the 
y direction, the triangulation probability density functionis multiplied with 
the donor implantation probability density function (see Supplementary 
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Information section 7A for details). This has little effect laterally, but greatly 
confines the likely depth range of the donor within the range expected onthe 
basis of the donor implantation parameters. The most likely donor position, 
indicated by across, is at the lateral location (x, z)=(13nm, 8nm) ata depth of 
y=-5nm. The probability density functions are normalized over the model 
volume and are integrated over the out-of-plane axis in both panels, 
specifically, P(x, z) = fP(r)dy and P(y,z) =fP(r)dx. The contour lines mark the 
68% and 95% confidence regions. 
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When a magnetic impurity exists in a metal, conduction electrons forma spin cloud 
that screens the impurity spin. This basic phenomenonis called the Kondo effect’. 
Unlike electric-charge screening, the spin-screening cloud* ° occurs quantum 


coherently, forming spin-singlet entanglement with the impurity. Although the spins 
interact locally around the impurity, the Kondo cloud can theoretically spread out 
over several micrometres. The cloud has not so far been detected, and so its 

physical existence—a fundamental aspect of the Kondo effect—remains 


controversial”® 


. Here we present experimental evidence of a Kondo cloud extending 


over alength of micrometres, comparable to the theoretical length &. In our device, a 
Kondo impurity is formed ina quantum dot?” “, coupling on one side to a quasi-one- 
dimensional channel” that houses a Fabry-Pérot interferometer of various gate- 
defined lengths L exceeding one micrometre. When we sweep a voltage on the 
interferometer end gate—separated by L from the quantum dot—to induce Fabry- 
Pérot oscillations in conductance we observe oscillations in the measured Kondo 
temperature 7,, which is a signature of the Kondo cloud at distance L. When L is less 
than &, the 7, oscillation amplitude becomes larger as L becomes smaller, obeying a 
scaling function of a single parameter L/&,, whereas when L is greater than &, the 
oscillation is much weaker. Our results reveal that , isthe only length parameter 
associated with the Kondo effect, and that the cloud lies mostly within a length of &. 
Our experimental method offers a way of detecting the spatial distribution of exotic 
non-Fermi liquids formed by multiple magnetic impurities or multiple screening 


channels? ?¢ 


and of studying spin-correlated systems. 


Although Kondo physics for asingle magnetic impurity has been well 
established except for its spatial extension, our understanding of mul- 
tiple impurity systems such as Kondo lattices, spin glasses, and high- 
transition-temperature (high-7,) superconductors is far from complete. 
Insuchsystems, the Kondo cloud length (or the spatial distribution of a 
Kondo cloud) with respect to the distance between impurities and other 
length parameters is crucial for an understanding of their properties. 
The detection and control of a Kondo cloud is therefore a milestone 
in condensed matter physics. There have been attempts to detect the 
Kondo cloud for 50 years? *” +, Nuclear magnetic resonance meas- 
urements have not found any signature of the cloud’. Scanning tun- 
nelling microscopy experiments have showna signature of the Kondo 
effect but ina regiona distance away from a magnetic impurity that is 
much shorter than the cloud length”. The difficulty lies in the fact that 
measuring spin correlation in the Kondo screening requires the fast 
detection of tens of gigahertz’ and there may be complications arising 
fromthe atomicor electronic structures of the sample. However, recent 
advances in nanotechnology have opened up another way of detecting 
the Kondo cloud. We can now prepare a single spin in a quantum dot 
(QD) in contact with an electron reservoir, thus achieving systematic 
control of asingle-channel Kondo state”. The theoretical value of the 
cloud lengthis typically? & =fv,/(kg7,) =1p1m (where k, is the Boltzmann 


constant) for 7, ~1K and the Fermi velocity is v, ~ 10° ms7. A recent 
theoretical study”’ of quantum entanglement shows that the Kondo 
state lies mostly within the distance &, from the impurity, with along 
algebraically decaying tail extending farther. Interesting proposals sug- 
gest the use of afinite-size electron reservoir, to observe competition 
between the cloud length and the reservoir size”. 

Here, instead of rigidly limiting the reservoir size, we perturb the 
reservoir by inducing a weak barrier at a position L far from the Kondo 
impurity and observe the resulting change in the Kondo effect with vary- 
ing barrier position, following a recent proposal”. Figure 1a shows the 
device and measurement schemes. Anunpaired electron spin (magnetic 
impurity) is confined ina QD coupled toa one-dimensional (1D), long, 
ballistic channel”©”. The QD is inthe Coulomb blockade regime. The 1D 
channel is tuned to contain several conducting channels and has three 
quantum point contact (QPC) gates placed away from the QD at lengths 
L=1.4 1m, 3.6 pmand 6.1m. Application of voltage V,,, to one of the 
QPC gates creates a weak barrier so that a Fabry-Pérot (FP) cavity?’ of 
length L is formed between the QD and the OPC. The charging energy 
of the FP cavity is ineffective owing to strong coupling tothereservoir 
through multiple conducting channels. Changing V.,, allows us to con- 
tinuously tune the FP cavity between on- and off-resonances by altering 
the cavity length on the scale of the Fermi wavelength AL = A, (where 
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Fig. 1| Measurement setup and characterization. a, Device and measurement 
schematic. The device consists of aQD coupled to a1D channel (see Methods), 
in which three QPC gates are embedded at distances Z = 1.4 1m, 3.6 pm and 
6.1m fromthe QD. Theactivation of a QPC gate creates a FP cavity of lengthL. 
The QDis tuned viaa central plunger gate voltage V,, and two side gates V, and 
Va. The device is measured via the lock-in method: asmall a.c. voltage Vp is 
applied, andthe current /,, through the system is measured. b, The 
conductance G (measured in units of quantum conductance, e”/h) of the device 
versus the plunger gate voltage Va, and the Z = 1.4 1m QPC gate voltage Vopc. 
Coulomb blockade peaks are observed with respect to changing Vp. 
Oscillations associated with the FP cavity are seen with respect to changing 
Vopc- €, Conductance G versus Vp taken at Vop- = 0. Coulomb blockade anda 
region of enhanced conductance around Vay) =—0.50 V associated with the 
Kondo valley are clearly observed. 


A, =40 nm << L, &,) without affecting the potential profile around the 
QD. We find that changes in Vop- strongly affect the measured Kondo 
temperature 7, when the cavity lengthZ is shorter than &,.,=hu,/(k,T,..)- 
Here &,..and 7,.. are the bare theoretical cloud length and bare Kondo 
temperature defined in the absence of the QPCs or equivalently for the 
case of L =~». For L >> &.., changes of Vop- have little effect on T,. This 
implies that the Kondo state extends over about &,... 

The device is defined using top gates deposited on top of a GaAs/ 
AlGaAs two-dimensional electron gas (2DEG) wafer, with an electron 
mean free path of about 8 um (which is bigger than the device size). 
The QD population is controlled via a middle plunger gate voltage 
Vop. The coupling of the QD to the 1D channel on the right side and to 
the left side is adjusted by changing the side gate voltages V, and V,, 
respectively. The QD is coupled more strongly tothe right 1D channel 
than the left channel, so that the Kondo state is sensitive to the FP cavity 
of the 1D channel. We tune V, to change the bare cloud length &,.. and 
Kondo temperature 7,... Figure 1c shows the conductance G between 
the right 1D channel and the left lead via the QD as a function of the 
plunger gate voltage V,, when the QPCs are turned off. Several Cou- 
lomb blockade peaks are clearly visible with the measured charging 
energy of >500 pleV (ref. 7°). The Kondo effect is observed, manifest- 
ing itself in increased conductance in the valley region between two 
Coulomb blockade peaks”?"®. The effect of the QPC gates is shown in 
Fig. 1b, which plots the conductance G versus plunger gate voltage Vgp 
as well as the voltage Vop- applied to the QPC gate at Z =1.4 pm. Both 
the Coulomb blockade peaks as well as the Kondo valley undergo FP 
oscillations with respect to changing Vopc} we note that at small Vopc 
(equivalent to the first few FP oscillations) there is no effect on the 
QD energy level. The resonance level spacing A ~ 300 pleV estimated 
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Fig. 2| Influence of FP interference onthe Kondo effect.a, Conductance Gat the 
centre of the Kondo valley versus the QPC gate voltage Vop, at L = 1.4 mand device 
temperature 7. Gdecreases with increasing 7, indicative of the Kondo effect. For 
each Vopc, the Kondo temperature 7, is extracted by fitting conductance versus 
temperature to an empirical formula”. b, Plot of the extracted 7, versus Vopc 
(blue), shown alongside the conductance of the Kondo valley centre at base 
temperature G (green). 7, oscillates with respect to Vp, but in anti-phase with 
respect to conductance. The 7, oscillation amplitude is quantified by tracking the 
maximum 7x maxand minimum Ty pin Of the first oscillation. 


from the datais consistent with the cavity length / =1.4 1m, consider- 
ing that v, =2.46 x 10° ms‘. The resonance level broadening of about 
80 peV implies a weak barrier formed by Vop, (Supplementary 
Information). 

Tosee the effect of the FP oscillations onthe Kondo state, we measure 
the conductance G around the centre of the Kondo valley as a function 
of the QPC gate voltage Vo, at different temperatures 7 (Fig. 2a). The 
valley conductance decreases with increasing temperature, indica- 
tive of the Kondo state. For each value of Vop- we extract 7, around the 
valley centre by fitting conductance G versus temperature 7 to a well 
known empirical formula (see Methods)*°. This method of extract- 
ing 7, is applicable for constant density of states of the reservoirs, 
but is still applicable to our QD coupled to the FP cavity, because the 
resonance level broadening is sufficiently large or the QPC barrier is 
weak (see Methods). We find that the Kondo temperature 7, undergoes 
oscillations with respect to changing Vo», (Fig. 2b). Clearly, the Kondo 
state is affected by the perturbation ata location micrometres distant 
from the QD. The oscillation shows that the electron density of the 
FP cavity at the Fermi level that is coupled with the Kondo impurity 
differs between on and off resonances. This implies that the Kondo 
coherence is extended through the entire FP cavity, supporting the 
picture of the spatial extension of the Kondocloud. It alsoimplies that 
the resonance level spacing A is larger than the Kondo temperature, as 
theoretically expected. We note that, based on the scatterin 7, versus 
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Fig. 3| Shape of the Kondo cloud revealed from modulation of the Kondo 
temperature. Oscillation amplitude of 7, as a function of the FP cavity length L. 
The amplitude is quantified by In (7, max/7..), the maximum value Tx max of the 
oscillation normalized by the bare Kondo temperature 7,., inlogarithmic scale, 
and _Lisscaled by the bare Kondo cloud length &,... We note that 7,.. is estimated 
aS Tk. = (Tk, max2k, min)2 (see Supplementary Information). The green squares are 
obtained from the oscillation of 7, with respect to changes in the voltage Vopc 
of the QPC gate located at  =1.4 1m fromthe QD, the blue circles are for 
L=3.6um, and the magentatriangles are for  =6.1p1m. Different data points of 
thesamesymbol correspond to different 7,., (that is, different settings of V, 
and V,). The data are compared with the theoretical results of the NRG 
calculation (red crosses) and the scaling of IN (7, max/T kee) = —MIn(L/&..) with 

n= 0.47 (dashed curve). The inset shows the same dataas the oscillation 
amplitude of 7, versus the lengthZ not scaled by 7,... The different curves 
represent those having the same Ty... 


Vopc (where a smooth curve is expected), we estimate an error for the 
extracted T, of about 20%. 

The oscillations of 7, are anti-phase with those of the conductance. 
This agrees with scattering theories combined with the Fermi liquid 
and numerical renormalization group (NRG) methods when the QDis 
coupled more strongly to the FP cavity than to the left channel (Sup- 
plementary Information). In the on-resonance situations where the 
Fermi momentum k; satisfies e“" = 1, the FP cavity supports a maxi- 
mum amount of electrons at the Fermi level, and hence the Kondostate 
with maximum Kondo temperature is developed. The development 
effectively changes the resonance condition to off-resonance, result- 
ing ina minimum value of the conductance, since the electrons gain 
twice the scattering phase shift 11/2 off the Kondoimpurity inthe QD”. 
The opposite happens in the off-resonance situations of e!74! = - 1. 
We note that evidence of the Kondo scattering phase 1/2 has been 
observed”, and that the antiphase in Fig. 2b can be considered as 
further evidence. To quantify the effect of the FP cavity onthe Kondo 
state we track the maximum 7,,,,and minimum 7, ,,;, of the first oscil- 
lation of 7,. Only the first oscillation is used because the subsequent 
oscillations require a higher V.,,, which will have a stronger effect on 
the QD owing to capacitive coupling through the FP interferometer 
island. 

We now discuss the dependence of the oscillation amplitude of 7, 
onthe FP cavity length shown in the inset of Fig. 3. The amplitude is 
quantified by In(7\ max/Tx..),and we estimate th¢bare Kondo tempera- 
ture from the oscillation as Ty.. = (Tk, max 7k, min’2 (See Supplementary 
Information for a validation of the estimate); 7,..is not directly acces- 
sible because a QPC gate can form a barrier even when its voltage is 
turned off. The oscillation amplitude is small, that is, Ty max% 1-17, 
OF Tq max 1-27, min for cavity lengths Z =3.6 pm and 6.1ppmand T,..=0.39- 
0.44 K. By contrast, the amplitude becomes much larger for L =1.4 um, 
for example, Tx max 23-27. OF Tk max ~ 107k min fOr T,..= 0.31 K. For each 
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L,the amplitude becomes larger as T,.. is smaller. The result shows that 
the Kondostate is sensitive (oscillation amplitude of >30%) to the per- 
turbationat distance L < 3.6 um for 7,.,.= 0.39-0.44 K, while also sensi- 
tive at L > 3.6 um for 7,..= 0.31K. This implies that the cloud length is 
close to 3.6 um for T,.. = 0.39-0.44 K, while larger than 3.6 pm for 
Tx. = 0.31K. This finding is consistent with the bare cloud length &,.. 
estimated by the theoretical relation of &.. = fu;,/(kgT,..), that is, 
0 = 5.19 pm, 4.12 pm, 3.92 um and 3.65 um for T,..=0.31K, 0.39 K,0.41K 
and 0.44 K, respectively. 

To see the universality of the results, we plot the oscillation ampli- 
tude of T, versus the cavity length L scaled by the bare cloud length 
&,..in Fig. 3. Since we analyse only the first oscillation, the transmission 
through the QPC is almost independent of V,,, and common for all 
QPCs. We find that all data points fall onto a single curve, as theoreti- 
cally expected for a fixed transmission through the QPC. This scaling 
result is the evidence that &, is the only length parameter associated 
with the Kondo effect. For L 2 ,.., the Kondo state is little affected by 
the perturbation at distanceL, because the maximum of the oscillation 
is 20% larger than the minimum, 7x max 1.27 mine ASL decreases below 
&,... the oscillation amplitude becomes very much larger, showing that 
the maximum is 1,000% larger than the minimum (Tx max LOT min) at 
L=0.1&,.. (we note that the oscillation amplitude is much bigger than 
the 20% error in the extracted 7,,). The increase follows the universal 
scaling” of IN(T x max/Tx..) =—q|n(L/&,..) with a constant 7 defined as the 
modulation of the density of states set by the QPC pinch-off strength. 
The plotis in good agreement with theoretical NRG calculations based 
on realistic parameters estimated from sample characterization (Sup- 
plementary Information). The result is consistent with the theoretical 
result” of the spatial distribution of the Kondo singlet entanglement 
that the main body of the Kondo cloud lies inside the length &,.. with 
along tail extending beyond €,... An equivalent, alternative picture is 
that foraFP cavity of length, there are L/é,.. (approximately k,7,../A) 
localized single-particle states of size ,..in a row. When L/€,.. >1, 
the single-particle state located closest to the Kondoimpurity forms 
the main body of the cloud and is wholly within L. Hence, the local 
perturbation at L affects only the other single-particle states contrib- 
uting to the cloud tail. When L/€,.. <1, the main body of the cloud 
extends beyond distance L and hence it is strongly affected by the 
perturbation. 

Our result provides evidence of the spatial distribution of the Kondo 
state over micrometres. It will be interesting to study the spatial distri- 
bution further, for example, by engineering the spatial spin screening 
and the entanglement of the Kondo state. For example, by applying 
large QPC gate voltage to our device, we could systematically study 
the screening cloud of aKondo box”. It willalso be valuable to study 
the spin screening by multiple independent channels as in the multi- 
channel Kondoeffects or ina situation of multiple impurities as inthe 
two-impurity Kondo effects, because those effects are accompanied 
by non-Fermiliquids anda quantum phasetransition®™. Our strategy 
of detecting spin screening by applying a weak electrostatic gate at a 
position distant from an impurity spinis applicable to the realization>* 
of these effects with systematic control. 

Our work is an initial measurement of a Kondo cloud. It may enable 
Kondo cloud detection in other systems and in other ways, suchas by 
measuring spin-spincorrelations. The universality implies that Kondo 
clouds in aconventional metal or inthe mixed-valence regime have a 
spatial distribution similar to that found here, albeit with a shorter 
&,. Ballistic electron transport over &, will be required to apply our 
approach. 
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Methods 


Sample preparation 

The system of the QD coupled to the quasi-1D FP resonant cavity 
was fabricated in a 2DEG (carrier density n = 3.12 x 10" cm, mobil- 
ity = 0.86 x 10° cm’ V's“) heterointerface using a standard surface 
Schottky gate technique. 


Measurement 

Measurements were performed inan Oxford Instruments MX100 dilu- 
tion refrigerator with a base lattice temperature of 40 mK. The base 
electron temperature was measured to be around 80 mK. (Electron 
temperature versus measured mixing chamber temperature was cali- 
brated via analysis of the QD Coulomb blockade peak width.) The quasi- 
1D channels were formed by applying an approximately —0.45 V (-1V) 
signal to the gates defining the bottom (top) boundary ofthe channel. 
(This voltage was sufficient to fully deplete the carriers below the gates, 
isolating the channel from the rest of the 2DEG.) Electron transport was 
measured using the lock-in method. An a.c. voltage oscillation with 
a d.c. offset was applied to a sample via a divider with the a.c. excita- 
tion set to 3-15 pV. The d.c. offset was set so as to achieve zero bias at 
the sample. (The d.c. offset was varied in order to calculate the QD 
charging energy.) Current through the sample was measured using a 
custom current sense amplifier with a current-sense resistor of 10 kO 
mounted onthe mixing chamber. Gate voltages were controlled viaa 
custom digital-to-analog converter (DAC). Temperature of the device 
was changed globally via a heating coil at the mixing chamber. 

We provide additional details about the gate voltage settings (see 
Fig. 1a). The voltage applied to the long global gate defining the upper 
boundary of the 1D channels and the QD was fixed to about -1.0 V for 
the entire experiment. The gate voltages defining the left and right 1D 
channels from the lower side were set to about 0.45 V, fully depleting 
the carriers underneath the gates but leaving several conducting chan- 
nels in each 1D wire. The gate voltages defining the QD were operated 
around —0.5 V, being varied in order to achieve different values of T,... 
The voltages on the tunnel coupling gates V, , were set in a range of 
-0.45 V to —0.6 V, while the middle plunger gate voltage V,p was swept 
between —0.4 Vand —-0.7 V. The QPC gates were set to zero voltage when 
not operated. We confirmed that Vop¢.4 aNd Vopcs¢) have no influence on 
the conductance through longer FP cavities defined by Vop¢i3,6) OF Vopcie.+ 


Calibration of measured sample temperature 

For temperatures below about 500 mK it becomes increasingly difficult 
to thermalize hot electrons sent tothe sample fromthe measurement 
electronics. In this measurement setup, the hot electrons were cooled 
at the mixing chamber via a thermalization coil as well as via copper 
powder filters. Nevertheless, calibration was required to match the 
measured mixing chamber lattice temperature to the actual electron 
temperature inthe sample. This was achieved by measuring the shape 
of the Coulomb blockade peaks of the QD with all the QPCs turned off 
(Extended Data Fig. 1c). Picking a peak that does not feature any Kondo 
temperature, we should expect that the conductance G(6V,) (where 
6V, is the shift in plunger gate voltage away from the Coulomb blockade 


(a6Vo) Jiret. 2°) Theconstant 
DegT 


ais independent of electron temperature and should depend only on 
the sample geometry. (Extended Data Fig. 1a shows the fit of G versus 
gate voltage V, taken at ameasured temperature Of Tyreasurea= 300 mK). 
Indeed, above Treasured > 6OO mK we see that a has little to no variation, 
so the mixing chamber thermometer temperature and the electron 
temperature are the same Teriiprated = / measured: BY looking at how the con- 
stant a evolves for Treasured < ©0OO MK we can now build the calibration 
for the electron temperature Tepjiprated = (7 measured) 2S Shown in Extended 
Data Fig. lb. Throughout the text, the temperature 7 refers to the cali- 
brated temperature Tyajprated: 


peak centre) will be proportional to cosh” ( 


QPC barrier strengths 

The strengths of the oscillations in 7, ina,/ 7.0. depends on the QPC pinch- 
off strength a. Here we define a=1 - (t,/t)” with t, being the hopping 
energy across the QPC and tbeing the hopping energy along the 1D 
channel. (a=0 means there is no pinchoff present due to the QPC, and 
a=1means that the QPC fully decouples the FP cavity from the rest 
of the wire. See the Hamiltonian in the Supplementary Information.) 
Belowwe estimate ain three different ways, whichshowa = 0.1. Fitting 
the main results (Fig. 3) to the NRG calculations, we arrive at a= 0.1. 
In addition, we find that a can be obtained from the ratio of on- and 
off-resonance conductances of the Kondo valley at T=0 as follows: 
a@=1- (Gy min/Gomax)°®. The conductances at zero temperature can be 
extracted similarly to 7, by fitting to the empirical formula*’ (see below 
for fitting details). Averaging the data over several values of V, and 
V, we arrive at a= 0.1. Finally, we estimate a also by considering the 
behaviour of the system away from the Kondo regime. The strength is 
proportional to the fluctuations in the local carrier density p insidethe 
FP cavity: @=1-(Pmin/Pmax)??, WHEL Pmmaximin is the maximum (minimum) 
value of the fluctuations in p. In turn, the effective coupling strength 
[of the QD tothe FP cavity is proportional to the local carrier density 
p. The coupling strength /, directly affects the width of the Coulomb 
blockade peak with respect to gate voltage V,. Indeed, as we tune the 
FP resonance by changing the QPC gate voltage Vop,-, the Coulomb 
blockade peak undergoes fluctuations in width in synchrony with 
fluctuations in conductance (Extended Data Fig. 2). Taking the first 
oscillation, we arrive at the QPC pinchoff strength a =1-(Wrin/Wonax) >” 
= 0.1, where Wis the width of the oscillation. 

Forthe case of ballistic electrontransport and asimilar a for all three 
QPC gates, one would expect the amplitude of the FP oscillations in 
Coulomb-blockade peak conductance to be independent of the QPC 
gates at different distances. Indeed, when comparing the oscillations 
for the same settings of V, and V,, but activating QPC gates at the three 
different distances, we see that the amplitudes are within 15% of each 
other (Extended Data Table 1). 


Kondotemperature estimation 
The Kondo temperature 7, is estimated by fitting the experimental 
data to the empirical formula”? of 


12. 
a(n) =64| ath us ) 


2 12 
+ Tk 


with Tj = 7/2 - 1, where the zero-temperature conductance G, and 
the exponent sare fitting parameters; according to ref. °°, the exponent 
s should not vary much from 0.22; in this sense, the fitting has only two 
fitting parameters, 7, and Go. This estimation works well when the 
density of states of the reservoirs coupled to a Kondo impurity is 
approximately energy-independent near the Fermi level.In our setup, 
in which the Kondo impurity in the QD is coupled to the FP cavity, the 
density of states is energy-dependent and the estimation is applicable 
when the QPC barrier defining the FP cavity is so weak that the energy 
dependence is not crucial. The applicability of the estimation is con- 
firmed by our NRG calculation with the model parameters chosen from 
the experimental data (Supplementary Information). 

An example of the fit of our data to the empirical formula is shown 
in Extended Data Fig. 3, with 7, and G,as fitting parameters (Extended 
Data Table 2). (On average, the fitting function returned a degrees-of- 
freedom adjusted R’ value of 0.995 for all the values of Vop- within the 
first oscillation cycle.) The trend of Gversus Tis described well by the 
empirical formula with s = 0.22 + 0.01 (Extended Data Table 2).The fit 
is done with the temperature dependence of the conductance G over 
an electron temperature window of 0.1-0.5 K. In usual cases (namely, 
in the absence of the FP cavity), the conductance G measured within 
the temperature window [0.57,, 1.57,] around the 7, is sufficient to 


correctly estimate the Kondo temperature from the fit; this can be 
confirmed by using NRG calculations or by analysing the empirical 
formula. It is because this window can capture the Kondo crossover 
(from the Kondo fixed point to the local moment fixed point). Even with 
the FP cavity, this estimation is expected to work wellinthe regime of 
L<04€,.,according to our NRG calculation (Supplementary Informa- 
tion). The estimation becomes worse as increases (>0.4€,..), but not 
enough to affect our conclusion in the main text and Fig. 3. 

We remark two points. First, while the fitting function implies a good 
fit, itis difficult to estimate the error in the fitted 7, froma single fit. 
Instead, as discussed in the main text, we obtain an error by looking 
at the scatter of 7, when the QPC voltage is changed. In an ideal case, 
the Kondotemperature is expected to evolve smoothly with QPC volt- 
age. That is, for asmall change in voltage AVop- we expect only a small 
deviation in temperature 7,(Vopc) = T(Vopc + AVopc). Large changes in 
T, for small changes in Vp, are therefore attributed to error. Using this 
method, we arrive at an error in 7, of about 20%. 

Second, the fitting parameter s = 0.22 implies that our experiment 
is inthe Kondo regime rather than in the mixed-valence regime. The 
empirical formula provides a good fit with its parameter s=0.22inthe 
Kondo regime, while the parameter deviates rapidly from 0.22 in the 
mixed-valence regime*°. 

We note that the estimation of 7, directly from the experimental data 
is a merit of our experimental regime; it is unclear how to estimate 7, 
directly inthe opposite regime where the QPC barrier is strong (corre- 
sponding to afinite-size reservoir coupled toa QD” ). This is because 
the temperature dependence of the conductance Ghas a nontrivial 
feature due to the FP resonance when the QPC barrier is strong (the 
nontrivial feature is enhanced as the barrier becomes stronger, and 
the feature is not captured by the empirical formula), as shown in our 
NRG calculations. 


Modelling 

To model our experiments, we theoretically study an Andersonimpurity 
formed ina QD coupled to two1D leads, one of which houses a FP cavity. 
The Hamiltonian of the model is found inthe Supplementary Informa- 
tion. The parameters of the model are estimated from the experimental 
data. The QD spectral function is obtained by using the NRG method, 
and thetemperature dependence of the conductance is computed! by 
combining the spectral function and the Fermi-Dirac distribution func- 
tion. The feature in Fig. 2 (that the oscillations of 7, are anti-phase with 
those of the conductance) is reproduced in our theory in two different 
ways. The feature is obtained by using the scattering-matrix formal- 
ism combined withthe Fermi liquid theory and by taking into account 
the scattering phase shift 1/2 off the Kondo impurity. The anti-phase 
feature is also found in an independent way based on the NRG method 
and conductance calculation. Onthe other hand, the NRG result of the 
Kondo temperature 7, in Fig. 3 is obtained, in the same way as for the 
experimental estimation of 7, by fitting the conductance obtained 
by the NRG method to the empirical formula. The universal scaling 
behaviour in Fig. 3 is obtained by using the Poor Man’s Scaling” and it 
is confirmed by NRG calculations; the Poor Man’s Scaling and the NRG 
give the same scaling form but with different values of the parameter 


n, as n depends onthe estimation method of 7,. The details are found 
inthe Supplementary Information. 


Universal scaling at L < &.. 
The experimental data in Fig. 3 show the behaviour of 


INT max/ Tk) a= I} In(L/& 7») (1) 


atL <&,... This behaviour is scaled only by 7,.. and &.., So it is a universal 
feature characterizing the core region (hu,/U<L<&,..) ofthe Kondo cloud. 
Here Uis the QD charging energy (see Supplementary Information). 

We derive the behaviour. In our experimental regime of t, << t, and 
small a, the Poor Man’s Scaling” leads to In( Tx max/ Tk.) * ~aln(L/&..)- 
Here ¢, (¢,) is the tunnelling amplitude between the QD and the left 
channel (the FP cavity). See Supplementary Information. In this esti- 
mation based on the Poor Man’s Scaling, 7 equals a. This implies that 
the coefficient 7 is determined mainly by the resonance broadening 
parameter a for small a. The NRG calculation confirms the universal 
scaling in equation. (1), but with 7 # a. The value of n depends on the 
estimation method of the Kondo temperature; the Kondo temperature 
is defined inthe Poor Man’s Scaling in a way that is different from that 
in the NRG method. We find that the scaling behaviour also occurs in 
the other regime of ¢,, and a including the regimes of large a. We also 
note that at L > &,.., the Kondo cloud has a long tail following another 
algebraic scaling law characterized by quantum entanglement or elec- 
tron conductance*”. 


Data availability 
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Extended Data Fig. 1| Calibration of measured sample temperature. 

a, Conductance versus QD gate voltage V, taken at a temperature of 300 mK. 
The grey lines show the fit of the data to the theoretical lineshape fora 
Coulomb blockade peak. b, The electrontemperature in our device Teatitrated 
versus the lattice temperature measured by athermometer located at the 
mixing chamber Theasurea: €, Conductance of the Coulomb blockade peaks 
versus gate voltage (around the peak centre) 6V, shown for several values of the 
measured temperature. (a.u., arbitrary units). 
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Extended Data Fig. 2 | Full width at halfmaximum ofa Coulomb blockade 
peak versus QPC gate voltage V,,. Data are shown for a Coulomb blockade 
peak away from the Kondo valley. Theinset shows conductance Gthrough 

the QDas a function of (shifted) gate voltage 6V, and QPC gate voltage Vopc. 
Oscillations of both peak conductance and peak width with respect to Vop- are 
clearly observed. 
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Extended Data Fig. 3 | Plot of conductance versus temperature at the centre temperature was observed. Green data points correspond to datatakenata 
of the Kondo valley. The lines are fits to the empirical formula. a, The datain midpoint beween the red and blue datasets. b, Data at V.p- = 0 V, for several QD 
red correspond to Vopc, at which the minimum Kondo temperature was settings corresponding to different L/&... 

observed. The data in blue correspond to Vop,, at which the maximum Kondo 


Extended Data Table 1 | Coulomb-blockade peak FP 
oscillation amplitude 


QPC distance LZ Oscillation amplitude AG 


1.4 um 0.080 e7/h 
3.6 um 0.073 e?/h 
6.1 um 0.071 e?/h 


The amplitude of FP oscillations in Coulomb-blockade peak conductance versus distance to 
the QPC gates. 
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Extended Data Table 2 | Fitting parameters with upper and lower bounds 


Fitting Parameter Lower Bound Upper Bound 


Kondo Temperature Tx OK 1.2K 
Conductance at T = 0 Go 0 e7/h 


Power Law Factor s 0.21 0.23 
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® Check for updates 


Spin-based logic architectures provide nonvolatile data retention, near-zero leakage, 
and scalability, extending the technology roadmap beyond complementary metal-— 
oxide-semiconductor logic 


113’ Architectures based on magnetic domain walls take 


advantage of the fast motion, high density, non-volatility and flexible design of 
domain walls to process and store information’*"* "*. Such schemes, however, rely on 
domain-wall manipulation and clocking using an external magnetic field, which limits 
their implementation in dense, large-scale chips. Here we demonstrate a method for 
performing all-electric logic operations and cascading using domain-wall racetracks. 
We exploit the chiral coupling between neighbouring magnetic domains induced by 


the interfacial Dzyaloshinskii—Moriya interaction 


1720 which promotes non-collinear 


spin alignment, to realize a domain-wall inverter, the essential basic building block in 
allimplementations of Boolean logic. We then fabricate reconfigurable NAND and 
NOR logic gates, and perform operations with current-induced domain-wall motion. 
Finally, we cascade several NAND gates to build XOR and full adder gates, 
demonstrating electrical control of magnetic data and device interconnection in logic 
circuits. Our work provides a viable platform for scalable all-electric magnetic logic, 
paving the way for memory-in-logic applications. 


Our concept for chiral magnetic domain-wall (DW) logic takes advan- 
tage of the efficiency and speed of magnetic DW motion induced 
by spin-orbit torques (SOTs)'*”!”’ and exploits the chiral coupling 
between adjacent magnets with competing magnetic anisotropy and 
interfacial Dzyaloshinskii-Moriya interaction (DMI)” ~° (Fig. 1a). Based 
onthis coupling, we demonstrate thatit is possible to invert a DW using 
an electric current, namely, to transform an up/down (©|@) DWintoa 
down/up (@|©) DW or vice versa. Here, the magnetization directions 
of @ and © in the racetrack represent the Boolean logical values ‘1’ and 
‘0’, respectively. In Fig. 1b, we illustrate the design of a DW inverter, 
which consists of an in-plane (IP) magnetized region embedded ina 
racetrack with out-of-plane (OOP) magnetization. As the two OOP- 
magnetization regions (hereafter, ‘OOP regions’) on either side of the 
IP-magnetization region (hereafter, ‘IP region’) are coupled by the 
DMI, the reversal of one OOP region induces the reversal of the other, 
leading to the inversion of a DW travelling along the racetrack, which 
is equivalent to a NOT gate. 

To demonstrate the operation of the DW inverter, we fabricated a set 
of OOP magnetic Pt/Co/AIO, nanowires with 50-nm-wide V-shaped IP 
regions patterned using a selective oxidation process (Fig. 2a, Extended 
Data Fig. 1). The DW motion is driven by an electric current (Fig. 2b) 
and tracked with polar magneto-optic Kerr effect (MOKE) microscopy. 
Starting from the initial down-right-up magnetization configuration 
of the OOP-IP-OOP structure (@ > ©), an ©|® DWis injected from the 
left OOP region (Fig. 2c). By applying asequence of current pulses, the 
©|@ DW moves in the direction of the current towards the IP region, 
as expected for a left-handed chiral Néel DW??*, When the ©|@ DW 
encounters the IP region, the IP magnetization switches, going from 


Fig. 1| Chiral coupling between adjacent nanomagnets and current-driven DW 
inversion. a, Schematic of magnetic chiral coupling induced by the interfacial 
DMI. After selective oxidization, the magnetizations of neighbouring OOP 
(oxidized; red-shaded) and IP (unoxidized; blue-shaded) regions align with a left- 
handed chirality in Pt/Co/AIO,. b, Schematic showing current-driven DW 
inversion, which occurs as the DW transfers across the IP region. The white- 
shaded region is the DW, and the directions of the effective field induced by the 
SOTs, Hsor, and the DW velocity, vp, are indicated with arrows. In Pt/Co/AIO,, 
both©|@ and @|© DWs move in the same direction as the electric current/. 
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Fig. 2|Current-driven DW inverter.a, Schematics of a NOT gate and current- 
driven DW inverter. Red- and blue-shaded regions indicate regions with OOP 
and IP anisotropies, respectively. b, Coloured scanning electron microscope 
(SEM) image of seven parallel DW inverters in a three-dimensional rendering of 
the DW measurement setup. The direction of the current pulses is indicated. 
c, MOKE image sequence of DW inversion for a DW incident from the left with 
an ©|@ configuration. The edges of the magnetic racetracks are indicated by 
red dashed lines and the positions of the inverters are shown by white lines. 
The bright and dark regionsin the racetracks in the MOKE images correspond 
to©and @ magnetization, respectively. The entire image sequence for seven 
inverters is shownin Supplementary Videol1.d, X-ray magnetic circular 
dichroism (XMCD) image sequence of DW inversion for anincident DW with an 
©|@ configuration measured by STXM. Eachimage is captured after the 


®>© to @< ©, accompanied by the annihilation of the DWto the left 
of the IP region and nucleation of a new DW with opposite polarity to 
the right of the IP region. Insight into the microscopic mechanism of 
the DW inversion is provided by the combination of scanning transmis- 
sion X-ray microscopy (STXM) and micromagnetic simulations (Fig. 2d, 
Supplementary Figs. 1, 2). As the incident ©|@ DW approaches the IP 
region, it is compressed against the IP region by the SOTs, increasing 
boththe magnetostatic and exchange energies. The resulting compact, 
high-energy spin texture” can only be unwound by annihilating the 
incident DW and switching the IP magnetization with the help of SOTs. 
Upon switching of the IP magnetization from > to <,a @ domain nucle- 
ates on the right side of the IP region because of chiral coupling. This 
process is promoted at the tip of the V-shaped inverter owing to the 
additive contribution of chiral coupling from both sides of the V-shaped 
region. Therefore, the optimized design of the narrow V-shaped IP 
region facilitates the switching of the IP magnetization and the nuclea- 
tion of anew domain (Extended Data Fig. 2). As aresult, the ©|@ DWis 
effectively transmitted through the IP region and transformed into a 


@|© 


application of onecurrent pulse. The bright and dark regions inthe XMCD 
images correspond to© and @ magnetization, respectively. Micromagnetic 
simulations of the inversion process are shown on the right of each image, with 
the direction of the magnetization indicated by the colour wheel. The entire 
image sequence is shown in Supplementary Video 2.e, MOKEimages showing 
the inversion of a@ domain driven across the IP region with current pulses. The 
entire image sequence is shown in Supplementary Video 3. The current density 
and duration of the pulses in cand eare 7.5 10" A mand 50 ns, whereasin 
dthey are1.1x10"Am7and Ins. The scale bars are 3 um in the MOKE images 
and 500 nminthe XMCD images with simulations. In c-e, the number of 
current pulses applied before the acquisition of each image is indicated at the 
bottom right. 


@|© DW. An analogous inversion process occurs for an incident @|© 
DW (see Extended Data Fig. 3), so that the inverter effectively reverses 
the magnetization of domains travelling across the IP region, as shown 
in Fig. 2e. By using an electric current, it is possible to invert notonlya 
single DW, but also a sequence of DWs, and consequently a sequence 
of domains that propagate along a racetrack. This is a unique feature 
of chirally coupled nanomagnetic structures. 

Building on the principles used to construct the NOT gate, we now 
demonstrate howto realize a reconfigurable NAND/NOR gate. This 
gate makes our concept for current-driven DW logic functionally com- 
plete, given that any Boolean function can be implemented using a 
combination of NAND or NOR gates. The core structure of this gate 
(indicated by the dashed red circle in Fig. 3a) is composed of four OOP 
regions (Fig. 3b, in red) that form two logic inputs, one bias and one 
logic output connected via IP regions (in blue). To illustrate the func- 
tionality of the NAND gate, we fabricated four devices with the same 
core structure and different logic-input configurations (Fig. 3d, e). 
For each device, two DW reservoirs are connected to the inputs via 
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Fig. 3 |Reconfigurable NAND/NOR logic gates. a, Coloured SEMimage ofa 
reconfigurable NAND/NOR logic gate (right) and corresponding logic-circuit 
(left). Red, blue and yellow colours indicate regions with OOP and IP magnetic 
anisotropies and the Pt strip, respectively. b, Schematics showing the 
relationship between the magnetization inthe two logic inputs, the bias 

and the logic output for the NAND gate. c, Truthtable for the reconfigurable 
NAND and NOR logic operations. d, e, MFM images of different logic 
operations for inputs of ‘11’, ‘10’, ‘01’ and ‘00’ with a bias of ‘0’ (d) and ‘1’ (e) and 
the corresponding logic-circuit diagrams. The DW reservoirs (circular pads at 
the top of each image) and the bias are set to logical value ‘O’ (d) and ‘I’ (e) by 
applying an OOP magnetic field of 1 kOe. f, MOKE image sequence and 
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corresponding schematics showing the operation of asingle NAND gate 
witha sequence of logic inputs driven by an electric current. The initial 
magnetizationis set tobe © by applying an OOP magnetic field of 1kOe. The 
greenand purple areas inthe schematicscorrespond to© and@ 
magnetization, respectively. The boundaries of the logic gate are indicated by 
red dashed lines and the positions of the inverters are indicated by white solid 
lines in the MOKEimages. The direction and the number of current pulses 
applied before each MOKE image are indicated (current density, 7.5 x10" Am”; 
pulse length, 30 ns). The entire image sequenceis shownin Supplementary 
Video5. The bright and dark areas in the device regions ind-fcorrespond 

to© and @ magnetization, respectively. All the scale bars are1pim. 


Current 
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Fig. 4| Electrical control of data flow and cascaded DW logic circuits. a, Data- 
flow switch through a Y-shaped structure, initialized so that the magnetization 
points @ everywhere within the device. Left, coloured SEM image of the device 
and corresponding logic-circuit diagram. Right, MFM images of the magnetic 
configuration when current flows through the upper racetrack (top) and 

then the lower racetrack (bottom). b, Sequence of MOKE images illustrating 
electrical control of dataflow throughacross structure. The direction and 
number of current pulses are indicated (current density 9 x10" Am”, 

pulse length 30 ns). A coloured SEM image of the cross structure and the 
corresponding logic-circuit diagram are shown at the top. The entire image 
sequenceis shown in Supplementary Video 6. c, XOR gate fabricated by 


racetracks. The four logic-input configurations of ‘11’, ‘10’, ‘Ol’ and ‘00’ 
areachieved by placing inverters after some of the DW reservoirs. The 
two DW reservoirs and the bias are set to logical value ‘O’ by applying an 
OOP magnetic field of 1 kOe. By applying current pulses, the @ magnetic 
domains propagate from the DW reservoirs with inverters (logic inputs 
‘Y), whereas the magnetization stays the same (©) for the racetracks 
connected to the DW reservoirs without inverters (logic inputs ‘0’). 
Asaresult of chiral coupling”°, the output magnetization depends on 
the relative alignments of the inputs and the bias (Fig. 3b, c), which is 
analogous to a majority gate**’. Therefore, for an © bias, the output 
magnetization switches to © only when both of the input magnetiza- 
tions are @; otherwise, the output magnetization is @. As shownin the 
magnetic force microscopy (MFM) images in Fig. 3d, the magnetization 
direction of the logic output is © (‘0’) for logic inputs of ‘11’ and @ (‘1’) 
for logicinputs of ‘10’, ‘Ol and ‘00’. This relationship between the logic 
inputs and the output corresponds to the required logic operations for 
a NAND gate (Fig. 3c). By changing the orientation of the bias to @, as 
shown in Fig. 3e, we can reconfigure the NAND gate into a NOR gate. 
Inthe latter case, the output magnetization is @ only when both of the 
input magnetizations are ©. Hence, our logic gatecan be reconfigured 
between NAND and NOR by switching the biasterminal, enabling rapid 
logic reprogramming during run time?”®. 


cascading four NAND gates. Left, coloured SEM image of the XOR logic gate 
and corresponding logic-circuit diagram. Right, MFM images of the XOR logic 
gate with logic inputs of ‘11’, ‘10’, ‘01’ and‘00’. The DW reservoirs and the bias 
are set to logical value ‘0’ by applying an OOP magnetic field of 1 kOe. Red, blue 
and yellow colours in the SEM images indicate the regions with OOP and IP 
magnetization and the Pt strips, respectively. d, Full adder gate. Top, logic- 
circuit diagram of the fulladder with input operands of a=‘0’, b=‘1' andacarry 
bit of c,,=‘1’. Bottom, MFM image of the full adder magnetic circuit. The bright 
and dark areas in the device regions in b—d correspondto © and @ 
magnetization, respectively. All the scale bars arel pm. 


We now demonstrate the operation ofa single NAND gate using cur- 
rent-driven DW motion to provide aseries of different logic inputsto the 
same gate over time (Fig. 3f). In this device, we place three DW inverters 
in the left racetrack and two DW inverters in the right racetrack. This 
means that asequence of current pulses will generate the sequence of 
logic inputs: ‘OO’-‘11’-‘01’-‘11’-‘10’. The corresponding logic outputs 
are then ‘I’-‘0’-‘1’-‘0’-'1 over time. For each operation, the DWs that 
give the twoinputs may notarriveat the gate at the same time. We can 
mitigate this by introducing a sufficient propagation delay time (see 
Methods). Ina real device, this can be achieved by clocking the electric 
current. Furthermore, it would be possible to control the logic inputs 
and bias terminals with magnetic tunnel junctions (MTJs) fabricated on 
the magnetic racetracks, which can also be used for electrical readout 
of the outputs”? (see Methods). 

In addition to forming a complete logic set, chiral DW racetracks 
fulfil three additional requirements for practical implementation in 
logic circuits, namely, input selectivity, data crossover and cascading 
of different logic gates. Using current-driven DW propagationthrough 
the Y-shaped structure in Fig. 4a, it is possible to electrically select 
the logic input. A simple cross structure allows DWs to propagate in 
orthogonal racetracks (Fig. 4b, Extended Data Fig. 4), which simplifies 
the design of crossovers, avoiding the complexity of metal bridges used 
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in conventional charge-based circuits. Moreover, because our logic 
inputs and outputs are based onthe same physical phenomena, several 
logic gates can be cascaded directly, without the need for additional 
transducers between magnetic and electric signals (Extended Data 
Fig. 5). As examples, we present in Fig. 4c a binary half adder created 
by cascading four NAND gates to form an XOR gate, and show the full 
adder operation by cascading 15 NAND gates in Fig. 4d. This circuit 
also demonstrates the possibility to fan outa single output that canbe 
used to drive the input of the next logic gates. A remaining challenge 
isto create magnetic logic circuits with feedback loops. This could be 
realized by using either an external electrical circuit to read the output 
and write this back to the input with MTJs, or an additional racetrack 
with an inverted current direction to drive DWs from the output back 
to the input. 

For device applications, the scalability and efficiency of magnetic 
DW logic circuits should also be addressed. Because the chiral coupling 
induced bythe DMI is effective at the scale of the magnetic moments, 
it shouldbe possible to reduce the size of the logic gates downtoa few 
nanometres by using advanced lithography techniques. The speed of 
the logic operations is relatedtothe DW velocity, whichcan reach several 
hundreds of metres per second for chiral DWs driven by SOTs”**. 
The operation time can be estimated from the time required for a 
DW to transfer across the gate, which can be as short as a few tens of 
picoseconds for an inverter scaled down to 10 x 10 nm? (see Methods). 
The energy consumption of a single NOT operation in our 0.8 x 1-um? 
racetracks is about 20 pJ, which would scale down to less than 20 aJ 
in structures with a footprint of 10 x 10 nm? (see Methods). The non- 
volatility of the magnetic inputs and outputs provides further energy 
savings because magnetic DW logic circuits do not consume power 
when idle and do not need reloading of data after power-off. These 
features make all-electric magnetic DW logic attractive for use in low- 
power, ‘instant-on’ microelectronic processors that are ubiquitous in 
modern-day electronics. 
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Methods 


Device fabrication 

The magnetic films were deposited on a 200-nm-thick SiN, layer ona 
silicon substrate using d.c. magnetron sputtering at a base pressure 
of <2 x10’ torr and a deposition Ar pressure of 3 mtorr, and pattern- 
ing was carried out by electron-beam lithography. Continuous films 
of Pt (S nm)/Co (1.6 nm)/AI (2 nm) were milled into strips with Ar ions 
through a negative resist (ma-N2401) mask. In these magnetic strips, 
the upper Co/Al bilayer was milled through a high-resolution posi- 
tive resist (poly(methyl methacrylate), PMMA) mask to create the DW 
racetracks and logic devices. To define the IP region in these magnetic 
structures, asecond PMMA mask was patterned by electron-beam 
lithography on top of the Al layer. Using a low-power (30 W) oxygen 
plasma at an oxygen pressure of 10 mtorr, the unprotected Al layer 
was oxidized to induce perpendicular magnetic anisotropy in the Co 
layer. Finally, electrodes of Cr (5nm)/Au (50 nm) were fabricated using 
electron-beam lithography combining electron-beam evaporation with 
a lift-off process. The main steps of the device fabrication are shown 
in Extended Data Fig. 1a. 

The different anisotropies, with OOP regions (exposed to oxygen 
plasma) and IP regions (protected by the PMMA mask), were confirmed 
by polar MOKE measurements (Extended Data Fig. 1b). The effective OOP 
magnetic anisotropy field was 3.94 kOe, as obtained from anomalous 
Hall effect measurements with an applied IP magnetic field (Extended 
Data Fig. 1c). The interfacial DMI constant D was estimated to be 
-0.9+0.1mJ m7” (1s.d.) by measuring the DMI-induced chiral coupling”. 


Electrical measurement configuration 

The magnetic DW motion and logic operation are driven by current 
pulses generated with an HP Agilent 8114A high-voltage pulse gen- 
erator and an AVTECH ultrahigh-speed pulse generator. The pulse 
generators can provide pulses of variable voltage and pulse width. 
The current densities are calculated by dividing the nominal voltage 
by the device resistance and cross-sectional area, and are indicated for 
each operation. The directions of the current pulses for each device 
are summarized in Extended Data Fig. 6. 


MFM measurements 

The MFM measurements were performed using a Bruker Dimension 
Icon Scanning Station mounted ona vibration- and sound-isolation 
table using tips coated with CoCr. To minimize the influence of the stray 
field from the MFM tip during the measurements, a thin PMMA layer 
(-20 nm thick) was spin-coated on the samples to increase the distance 
between the tip andthe magnetic film. The MFM images of the devices 
were captured after saturation with an OOP magnetic field to set the 
initial magnetization direction in all ofthe reservoirs, followed by cur- 
rent pulses applied in the Pt strips to obtain the final states. 


MOKE microscopy measurements 

The MOKE images were recorded using a custom-built wide-field MOKE 
microscope. A background image was captured after the application 
of alarge positive OOP magnetic field of 1kOe. The background image 
was subtracted from the subsequent images to achieve differential 
images with magnetic contrast. To prepare the initial state of the DWs 
shown in Fig. 2c, the racetrack was first saturated with an OOP magnetic 
field. The magnetic field was removed, leaving the racetrack with OOP 
magnetization, with a small area of reversed magnetization enclosed 
by the V-shaped IP region that results from the chiral coupling”. Then, 
current pulses were applied in the opposite direction to that shownin 
Fig. 2b tocreate a single DW onthe left side of the DW inverter (Fig. 2c). 


STXM measurements 


The magnetic configuration of the DW inverter was imaged using STXM 
at the PolLux beamline of the Swiss Light Source. The magnetization 


state was probed using XMCD at the Co L, absorption edge at normal 
incidence. The devices measured using STXM were fabricated on X-ray- 
transparent SiN, membranes. 


Micromagnetic simulations 

To understand the mechanism of the DW inversion, micromagnetic 
simulations were carried out with the MuMax3 code*’ using acomputa- 
tion box containing 2,048 x 1,024 x 1cells with 2 x 2 x 1.6 nm? discretiza- 
tion and the following magnetic parameters: saturation magnetization 
M,=0.9 MA‘, effective OOP anisotropy field H.=150 mT, exchange 
constant A =16 pJ m“, spin Hall angle of Pt 0,,= 0.1 and interfacial DMI 
constant D=~-1.5 mJ m”’. 


Mechanism for DW inversion 

To elucidate the basic mechanism behind DW inversion in an OOP- 
IP-OOP structure, we consider a simple model. The DW inversion pro- 
cess can be explained in terms of the effective DMI field generated in 
non-collinear magnets, where the DMI vector lies in the plane of the 
magnetic thin film. This effective DMI field is given by: 


_ 2D (dm, . dm, 
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where yp, is the vacuum permeability, and m, and m, are the zand x 
components of the magnetization, respectively. We can then consider 
a situation in which an ©|@ DWis driven by SOTs towards the IP-mag- 
netized region, as shown in Supplementary Fig. la. At equilibrium, the 
IP-magnetized region, together with the surrounding domains, forms 
a@ > ©configuration, which is stabilized by the DMI fields (pointing 
along +x) indicated by Hp,, (IP). On applying an electric current, the 
magnetization is subject to an effective field H,,;induced by the SOTs, 
which is given by: 


_ AOsyJ 
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where fh, 6.,,,J, e, Ms, t,m and o are the reduced Planck constant, spin 
Hallangle, electric current density, electron charge, saturation mag- 
netization, thickness of the magnetic layer, direction of magnetization 
and direction of the spin polarization at the Pt/Co interface, respec- 
tively. Owing to the chiral coupling, the magnetization in the middle 
of the ©|@ DW points along —x (in blue). As shown in Supplementary 
Fig. 1b, H,,;(DW) points to +z,so the ©|@ DW will propagate along the 
current direction. As soon as the ©|®@ DW approaches the IP region, 
the magnetization in the IP region experiences a dipolar field Hy, (IP) 
generated by the IP magnetization of the ©|@ DW that points along 
-x. The SOTs also compress the incident DW against the IP region, 
increasing the DW energy. This results in a compact, high-energy 
spin texture containing two closely spaced regions with tail-to-tail 
IP magnetization, as shown with the associated magnetic charges in 
Supplementary Fig. 1b: one IP region is in the middle of the ©|@ DW 
with magnetization pointing along —x (in blue), and the other IP region 
is in the inverter and has magnetization pointing along +x (inred). At 
a certain point in time, the dipolar field becomes strong enough to 
switch the magnetization in the IP region from +x to —x with the help 
of SOTs (Supplementary Fig. 1b, c). Simultaneously, the high-energy 
spintexture on the left side of the IP region collapses, bringing about 
the annihilation of the @ domain on the left side of the IP region, shown 
in grey in Supplementary Fig. 1b. After the reversal of the magnetiza- 
tion in the IP region, a reversed @ domain is nucleated on the right 
side of the IP region (shown in grey in Supplementary Fig. 1c), which 
is aconsequence of Hpy,(OOP) pointing along -z. The magnetization 
points along +x in the middle of the resulting @|© DW and H,o;(DW) 
points along -zso that this new DWis then transported by the electric 
current towards +x (Supplementary Fig. Ic). 
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Whereas this simple model provides insight into the mechanism of 
DW inversion, the detailed magnetization dynamics is more complex. 
We have therefore performed micromagnetic simulations. Here, an O|® 
DW is driven by an electric current with current density 3 x 10” Am~ 
ina narrow wire containing a straight, 30-nm-wide IP region (Supple- 
mentary Fig. 2a). The OOP anisotropy field is set to H, =1.5 kOe inthe 
OOP region and to zero in the IP region. All three components of the 
magnetization are recorded at three different positions along the wire: 
inthe centre of the IP region and 30 nm away from the centre on each 
side (see the dots in Supplementary Fig. 2a). We show in Supplemen- 
tary Fig. 2b-d how the magnetization responds to the approaching 
DW. As the ©|@ DW approaches the left side of the IP region, the mag- 
netization on the left side of the IP region reverses from —zto +z (see 
Supplementary Fig. 2b). The magnetization in the IP region reverses 
from +xto-—x along the path shown in Supplementary Fig. 2c to reduce 
the energy associated with the accumulated magnetostatic charges 
(shown schematically in Supplementary Fig. 1b). The magnetization 
onthe right side of the IP region is then forced to switch from +zto-z 
by the chiral coupling (see Supplementary Fig. 2d). 

The DMlis critical in the realization of current-driven DWinversion, 
not only to achieve current-driven DW motion, but also because of its 
rolein the nucleation of the reversed domain. The role of DMI in current- 
driven DW motion has been studied elsewhere”. Here, we deter- 
mine the role of DMI inthe DW inversion process using micromagnetic 
simulations by varying the DMI value and the OOP anisotropy in the 
IP region. The DMI-OOP anisotropy phase diagram for current-driven 
DW inversion is shown in Supplementary Fig. 3 for a current density 
of 3 x 10” A m7 ina narrow wire containing a straight, 30-nm-wide IP 
region. For zero OOP anisotropy in the IP region, the DW can be inverted 
when |D| >1mJ m~. If the DMI is reduced, it does not provide sufficient 
chiral coupling to nucleate the reverse domain, so the incident DW 
cannot be inverted. By introducing OOP anisotropy into the IP region, 
whichis expected from a Pt/Co interface, the energy for the DW inver- 
sion is reduced and the DMI operational window increases. 

To verify the impact of the IP width on the DW inversion process, we 
performed additional micromagnetic simulations of the magnetiza- 
tion dynamics in the inverter for various widths of the IP region. The 
outcomes of the simulations fora current density of 3 x 10" A m7 and 
D=-1.5 mJ m” are given in Supplementary Table 1. In the table, a tick 
indicates that an inverted DW propagates from the IP region into the 
OOP regionas required. If the IP region is too narrow (<25 nm), the OOP 
regions on either side of the inverter are strongly coupled antiparal- 
lel?° and the SOTs induced by the current are not strong enough to 
overcome the chiral coupling. If the width of the IP region is too large 
(>35nm), the chiral coupling becomes too weak to induce antiparallel 
coupling of the OOP magnetizations on the left and right sides of the 
IP magnetization”°. The DW is then simply annihilated in the IP region, 
without any further magnetization dynamics occurring on the other 
side of the inverter. The results of the micromagnetic simulations were 
confirmed by experiment: for astraight DW inverter in an 800-nm-wide 
racetrack, the DW was successfully transferred across a50-nm-wide DW 
inverter, but not across a100-nm-wide inverter. In addition, as shownin 
Supplementary Table 1, itis possible to increase the operational window 
of the IP region by including a small OOP anisotropy in the IP region. 


Influence of the shape of the IP region of the DW inverter 

Here we describe experiments comparing the performance of straight 
and V-shaped DW inverters with a 50-nm-wide IP region, beginning 
with the measurements of the straight IP inverter. As shown in the 
STXM images in Extended Data Fig. 2a, when the @|© DW encounters 
the IP region, it annihilates to the left of the IP region, and anew DW 
with opposite polarity will be nucleated to the right of the IP region. 
We performed several inversion operations in the same inverter with 
a straight IP region and found that the reversed magnetic domains 
nucleate at different locations (Extended Data Fig. 2b). This implies 


that the nucleation of the reversed magnetic domain is assisted by 
random thermal fluctuations or local inhomogeneities. 

To improve the reliability of the DW inverter, we implemented a 
V-shaped IP region, which has two main advantages: first, the tip of 
the V shape offers a convenient nucleation site for the reversed mag- 
netic domain. This is because, at the tip of the V shape, the output OOP 
region is surrounded by the input OOP region and experiences the 
strongest antiparallel chiral coupling. In the STXM measurements, 
we found that the nucleation of the reversed magnetic domain was 
located at the tip of the V shape for five out of five operations. Second, 
the Vshape of the IPregion leads to lower magnetostatic energy, thus 
lowering the energy barrier for DW inversion. As shown in Extended 
Data Fig. 2c, we compared the effective DW velocity measured in two 
DW inverters, one witha V-shaped and one witha straight IP region (see 
the method outlined in the Methods section ‘Estimation of the speed 
of alogic operation’). The velocity of the DW transferring across the 
V-shaped IP regionis higher than that in the straight IP region, and the 
standard deviation of the velocity is smaller in the inverter with the 
V-shaped IP region than inthe inverter with the straight IP region. This 
demonstrates the higher efficiency and reliability of the V-shaped IP 
region as a DW inverter. 


Estimation of the speed ofa logic operation 

Here we describe the estimation of the speed of alogic operationin the 
NOT gates. First, we measure the DW velocity, Upy, inthe uniform OOP 
region of the racetracks. Then we determine the DW displacement, 
Low from§, to S, across the NOT gate, following N current pulses (see 
schematic in Extended Data Fig. 7). From this, we can obtain the time 
taken by the DW totransfer across the NOT gate, tyo;, and therefore 
the effective velocity of the DW, v,7, as it transfers across the NOT 
gate and is inverted: 


Low ~ Lnot 
tnot= Nt pulse - Divs 
(3) 
v as Lnot 
NOT t 
NOT 


where fpuise is the duration of one current pulse and Lyor is the length 
of the NOT gate. With this method, we can determine v,y and vyor aS a 
function of current density (Extended Data Fig. 7). The pulse length was 
decreasedto 2ns for high current densities to reduce heating (inset in 
Extended Data Fig. 7). We find that the velocity of the DW in the NOT 
gate can reach 160 +17 ms" foracurrent density of 1.65 x 10” A m7. 
This value of the DW velocity is used to estimate the energy consump- 
tion (see main text). 

Here we estimate the time required for the DW totransfer across the 
DW inverter with the dimensions indicated in Extended Data Fig. 6, 
scaled down to 10 x 10 nm”. Taking the effective inverter DW veloc- 
ity determined from the experiment, 160 ms‘, the time for a DW to 
transfer across the downscaled inverter is ~60 ps. Fora more accurate 
estimation, we perform micromagnetic simulations for a downscaled 
inverter with dimensions of 10 x 10 nm’. At suchasmall scale, the device 
designis limited by the feature size that can be nanofabricated. There- 
fore, a straight IP region is considered instead of a V-shaped one, with 
a width of 10 nm. Taking the simulated effective inverter DW velocity 
of 118 ms“, the time for a DW to transfer across the inverter is 85 ps, 
whichis similar to the rough estimation above. The speed of operation 
can be further improved by optimization of the material to increase the 
DW velocity—for example, by using an amorphous magnetic material 
suchas CoFeB instead of Co—and optimization of the device design. 

Using a similar method to that used for the DW inverter, we deter- 
mined the speed ofa logic operation in the NAND gate experimentally. 
For this, the operation of a NAND gate with two DW inverters in the 
input reservoirs was captured using MOKE imaging. Following the 
application of current pulses, DWs propagate throughthe NAND gate 


and perform logic operations (see MOKE images and corresponding 
schematics in Extended Data Fig. 8). From these images, we determine 
the time required for the DW to transfer across the NAND gate to be 
tyanp = 74.1 ns with an effective DW velocity of Uyayp =10.8 m sat acur- 
rent density of 7.5 x 10"A m7. 


Synchronization and propagation delay times in the DW circuits 
Owing to the presence of defects that lead to pinning of the DWs and 
theintrinsic stochastic nature of current-driven DW motion, the arrival 
time of DWs at the logic gates can be different. In electronic logic cir- 
cuits, this is commonly addressed by introducing a propagation delay 
time for each operation, that is, the circuits are cycled at a rate that is 
slower than the longest internal propagation delay times. The same 
concept of propagation delay time can be applied to our magnetic DW 
logic gates to obtain a stable output that is independent of the arrival 
time of the input domains. In other words, with sufficient propagation 
delay time, all the DWs will arrive at the logic gate, which will result in 
the correct output for a given logic operation. 

To demonstrate how the introduction of a propagation delay time 
can improve the operational reliability of the gate, we consider the 
simplest case of a NAND gate with the logic inputs changing from ‘00’ 
to ‘1l’ over time. As schematically shown in Extended Data Fig. 9a, the 
arrival times of @|© DWs for logic inputs a and bare different. In this 
case, the change from ‘0’ to ‘I’ of input ais at a later time than that for 
input D, thatis, t, > ¢,, where ¢; (i=, b) is the time at which the inputs 
change. Once both DWs arrive at the gate, it will take some time for the 
nucleation of the reversed magnetic domain to occur that depends on 
the effective DW velocity in the NAND gate. The time at which the cor- 
rect magnetic domain propagates into the output racetrack is defined 
as the required propagation delay time, f,.1,,. We fabricated NAND gates 
with ‘11’ logic inputs with different input racetrack lengths to obtain 
different arrival times for the two logic inputs. On application of cur- 
rent pulses, a @|© DW propagates in both the left and the right input 
racetracks, thus providing a means to test the reliability of the propa- 
gation delay time of the logic gate. As shown in Extended Data Fig. 9b, 
all devices give the correct output ‘0’, demonstrating that the output 
of the magnetic DW logic gate is independent of the difference in the 
arrival time of the input DWs. 

In general, for all logic operations in the NAND gate, the operation 
includes: (i) DW propagation in the input racetracks, (ii) transfer of DWs 
across the logic gate and (iii) DW propagation in the output racetrack. 
The total operation time, and therefore the required propagation delay 
time, can then be expressed as: 


— Linput t Loutput : LNAND (4) 


Caelay 


Upw UNAND 


where L input Lour ANd Lyanp are the lengths of theinput racetrack, output 
racetrack and NAND gate, respectively. Upy and Uyanp are the DW veloci- 
ties inthe magnetic racetrack andinthe NAND gate, correspondingly. 
Assuming that the DW velocities in the magnetic racetrack and NAND 
gate have anormal distribution: 


© 2 
Upw * N(Opw, Opw) 


(S) 


ee 2 
Unanp = N(Onanp» ONaND) 


with the average velocity of the DW motion in the racetracks (NAND 
gate) pw (Oyanp) and the standard deviation py (Gyanp). To obtain a 
97.5% probability that the logic operation is successful 
(see Extended Data Fig. 9c), the required propagation delay time is 
estimated to be: 


Linput + L output + 


L 
NAND (6) 


Caelay,97.5% = 


Dpw-20pw — Dnann~ 29nanpd 


To demonstrate that a sufficient propagation delay time can improve 
the reliability for a statistically significant number of operations in 
a NAND gate, we placed the output of a NAND gate on a Hall cross 
(Extended Data Fig. 10a) and performed 1,172 measurements. For each 
measurement, the NAND gate was saturated with an OOP magnetic field 
to set the initial magnetization direction to © in all of the reservoirs, 
and aseries of current pulses was applied. The output was measured 
viathe anomalous Hall effect on application ofad.c. current. The pulse 
source and d.c. source were separated by a bias tee. As indicated by 
the change of the Hall resistance in Extended Data Fig. 10b, the output 
changed from ‘0’ to‘l’ and back to ‘0’. The electrical measurements were 
verified by MOKE measurements performed on the NAND gate (see 
MOKE images in Extended Data Fig. 10b). By using 14 current pulses 
(equivalent toa propagation delay time of 14 x 30 ns), the reliability of 
the NAND gate increased to >95% (Extended Data Fig. 10c). 

Foracascaded logic circuit, the propagation delay time is determined 
by the longest DW propagation route in the circuit. To decrease the 
propagation delay time, several possible approaches can be employed; 
for example, scaling down the dimensions of the circuit, increasing the 
DW velocity and decreasing the DW pinning by optimizing the materials 
and the nanofabrication to reduce imperfections. 


Reliability of the logic gates 

Torealize large-scale implementation of the logic gates, reliable opera- 
tionis essential. Here, we evaluate the reliability of the two basic NOT 
and NAND gates interms of device-to-device reliability and operational 
reliability (Supplementary Table 2). 

To demonstrate the high device-to-device reliability of the NOT gate, 
we fabricated 35 NOT gates, and 34 of them (97%) showed successful 
operation. Wealso performed the NOT operations with various current 
densities inthe range 4 x10" Am7to1.65 x10” A mina single device 
with 100% operational reliability. 

To test the device-to-device reliability of the NAND gate, we fabri- 
cated 56 NAND gates with different logic inputs, and the average suc- 
cess rate was found to be 42/56 (75%). The failure of some of the devices 
may be related to pinning of the DWs by defects in the material or by 
irregular features resulting from the nanofabrication. In particular, the 
width of the magnetic racetracks inthe NAND gate is 200 nm, compared 
to 800 nm for the NOT gate, which means that edge roughness can 
induce additional pinning. We performed 20 operations for each of 
four selected devices (Supplementary Fig. 4), and they all gave correct 
outputs, showing high operational reliability. 

We now consider the distribution of the device-to-device reliability 
for different logic inputs. For the 56 NAND gates, we fabricated 14 of 
each type with logic inputs ‘00’, ‘11’, ‘01’ and ‘10’. The ratio of the number 
of NAND gates that give correct outputs to the total number of NAND 
gates are 13/14, 11/14, 10/14 and 8/14 for logic inputs of ‘OO’, ‘11’, ‘O1’ 
and ‘10’, respectively. The device-to-device reliabilities for ‘OO’ and 
‘IV inputs are slightly higher than those of the ‘O1’ and ‘10’ inputs. This 
can be understood by considering the energy difference between the 
‘Y and ‘0’ outputs for the ‘00’, ‘11’, ‘Ol’ and ‘10’ inputs, which is given by: 


AE YO =~ (2Einput + Epias) 
AE Vo = 2Einput — Evins 

AE Yo =~ Epjias 

AE Vo =~ pias 


(7) 


where AE}, is the energy difference between the ‘T and ‘0’ outputs for 
input g (i = 11, 00, 01 or 10), Finpu: is the coupling strength between 
output and input, and £,,,, is the coupling strength between the output 
and the bias. From this set of equations, it follows that thestable output 
for the ‘11’ input is ‘O’ (AE,,. > 0) and the stable outputs for inputs ‘OO’, 
‘Ol’ and ‘10’ are all ‘1’ (AE,,. < 0), which satisfies the truth table of the 
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NAND operation. For the NAND gate used in the experiment, the bias 
is slightly smaller than that of the inputs. Because the energy of the 
coupling between two OOP magnetizations separated by the IP region 
is proportional to the length of their boundary, the coupling energy 
between the output andinput magnetization is larger thanthat between 
the output and bias magnetization, that is, Finpu: > Evias: Hence, |AE°°| > 
|AE™| > |AE®| = |AE"?| for the NAND operation. This trend in the energy 
difference between the correct and erroneous outputs for different 
logicinputs correlates well with the trend in the device-to-device reli- 
ability for different logic inputs. 

We also tested the operational reliability of the cascaded logic circuit 
(full adder) shown in Fig. 4d. The ratio of the number of successful 
operations to the total number of operations performed was found 
to be 28/30. 

Therefore, in our proof-of-concept experiments, we have demon- 
strated the high reliability of the magnetic gates. We further emphasize 
that there is still room to improve the device-to-device reliability in 
terms of optimization of the fabrication process, device design and 
material properties. 


Electrical control of logic inputs and detection of logic output 
Forthe proof-of-concept experiment shown in Fig. 3f, we ensured that 
specific DWs reached the inputs by placing inverters on the input race- 
tracks. After saturation with an OOP magnetic field, the magnetization 
direction in the racetrack was set to ©. On application of a current, 
the magnetization of the propagating DWs was reversed as they were 
transferred across each inverter. By placing different numbers of 
inverters at different positions in the input racetracks, we generated 
a sequence of logic inputs in order to obtain different inputs at the 
same gate over time, thus demonstrating the real-time operation of 
the gate. We also showed that electrical switching of DW propagation 
ina Y-shaped structure could be used to inject DWs and define specific 
logic inputs (Fig. 4a). 

For downscaled logic circuits, magnetic tunnelling junctions (MTJs) 
fabricated on the logic-input racetracks would provide amore compact 
method to control the logic inputs (Supplementary Fig. 5). Indeed, it 
has been shown that an MTJ ona magnetic racetrack can be used to 
write magnetic domains via spin transfer torque”. Therefore, MTJs 
fabricated on magnetic racetracks can be used to electrically control 
the logic inputs. For the detection of the logic outputs, we used MFM, 
MOKE microscopy and Hall measurements in our proof-of-concept 
experiment. For the downscaled logic circuits, MTJs fabricated onthe 
output racetracks could be used to electrically detect the logic outputs. 

Moreover, itis practical to not only read the output of a gate but to 
also transfer it to the input of another gate using MT] devices, in order 
to obtain information feedback. The feedback is critical for sequential 
logic operations such as those performed ina flip-flop gate. The MTJ/ 
racetrack hybrid structure can providea compact methodto perform 
complex logic. 


Energy consumption of downscaled logic devices 
To estimate the energy consumption of the inverter used in the experi- 
ments, we consider the area containing the V-shaped IP region, where 
the DW is reversed. The energy consumption per operation of the 
inverter is calculated from the power-delay product in the bottom 
Pt layer: 
2 2 
E=lRt= od (8) 
UNoT 


where/, p, W,L, hand vyo; represent the current density, resistivity of Pt, 
inverter width, inverter length, thickness of Pt layer (5 nm) and effective 
DW velocity inthe inverter, respectively. Taking the inverter dimensions 


(Wx L=0.8 x 1.0 ppm’), the resistivity of a thin Pt film (9 =30.0 pO. cm), 
the experimentally measured current density (/=1.65 x 10” Am) and 
the effective inverter DW velocity (vyo; = 160 ms”), the energy con- 
sumption per operation of the inverter is calculated from equation 
(8) to be 20.4 pJ. 

For arough estimation of the energy consumption of adownscaled 
inverter, we scale down the dimensions of the inverter while keeping 
the value of the Pt layer thickness, the Pt resistivity, the current density 
and the effective DW velocity across the inverter the same as those 
measured in the experiment. The energy consumption per operation 
for an inverter with the dimensions indicated in Extended Data Fig. 2 
scaled down to 10 x 10 nm’ is 25.5 aJ. For a more accurate estimation, 
we performed micromagnetic simulations for a downscaled inverter 
with lateral dimensions of 10 x10 nm’. At sucha small scale, the device 
designis limited by the feature size that can be nanofabricated. There- 
fore, a straight IP region is considered instead of a V-shaped one, with 
a width of 10 nm. Using a simulated current density of 1.2 x10” Am~ 
and effective inverter DW velocity of 118 ms* in equation (8), we find 
that the energy consumption per operation is 18.4 aJ, which is similar 
to the rough estimate above. This energy consumption for the down- 
scaled inverter is comparable to the switching energy of ~30 aJ found 
in advanced complementary metal-oxide-semiconductor devices”. 
Further improvement of the energy consumption can be achieved by 
optimizing the material and device design in order to decrease the 
required current density and increase the DW velocity. 

The above estimation concerns only the energy consumed in the 
logic gate, that is, the energy consumed for the inversion of a DW. Addi- 
tional energy is required to nucleate the DWs in the racetrack for the 
logic inputs, to detect the logic outputs and to move DWs along the 
interconnections. The total energy consumption therefore depends 
on the detailed design of the logic circuit. 
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Extended Data Fig. 1| Device fabrication and magnetic characterization. 
a, Schematics of main nanofabrication processes for magnetic DW logic 
circuits. (i) lon milling of magnetic Pt/Co/Al multilayer to create magnetic 
strips, (ii) ion milling to produce magnetic racetracks and logic gates, and 


(iii) oxidization of the Al layer inthe OOP regions. The inset shows an SEMimage 
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of the 50-nm-wide PMMA mask used to protect the IP region of the NAND gate 
shown in Fig. 3a during oxygen plasma treatment. The scale bar is100 nm. 

b, Polar MOKE measurement of the IP and OOP regions on application ofan 
OOP magnetic field. c, Anomalous Hall measurement of the OOP regionon 
application of anIP magnetic field. 
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Extended Data Fig. 2 | DW inversionina DW inverter with astraight IP 
region.a, STXM image sequence of DW inversion for an incident @|O DW 
performedina DW inverter witha straight IP region. Each XMCD image is 
captured after the application of two current pulses. The edges of the magnetic 
racetracks are indicated with red dashed lines and the positions of the inverters 
areindicated with solid white lines. The bright and dark regions inthe XMCD 
images correspond to© and @ magnetization, respectively. Thecurrent 
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density and duration of the pulses are 1.1x 107 Amand 1ns, correspondingly. 
b, STXMimages of the nucleation of reversed magnetic domains in the same 
DWinverter for four different operations.c, DW velocity ininverters with 
V-shaped and straight IP regions as a function of current density, determined 
from the experimental MOKE measurements. Error bars and shading represent 
the standard deviation of the DW velocity for 5measurements. All the scale bars 
are500nm. 
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Extended Data Fig. 3 | Further experimental demonstration ofthe DW 
inverter.a, b, MOKE image sequences of DW inversion for incident ©|@ (a) and 
@I© (b) DWs performed inthe same device used for Fig. 2c. The edges of the 
magneticracetracks are indicated by red dashed lines and the positions of the 
inverters are indicated by white lines. The bright and dark regions in the 
magnetic racetracks in the MOKE images correspond to © and@ 
magnetization, respectively. The current density and pulse length of the 
applied current pulses are 7.5 x 10" A m7 and 50 ns, correspondingly. The 
number of applied current pulses is indicated. The entire image sequences are 
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shown in Supplementary Videos 1, 4.c, STXM image sequence of DW inversion 
for anincident @|© DW and corresponding micromagnetic simulation. Each 
XMCD image is captured after the application of one current pulse witha 
current density of 1.110 Amanda pulse length of1ns. The bright and dark 
regionsin the XMCD images correspondto© and @ magnetization, 
respectively. For the simulated images, the IP directions of the magnetizations 
are given by the colour wheel, and the white and black regions correspond to 
© and @ magnetization, respectively. The scale bars inthe MOKE images are 

3 um and those inthe XMCD images with simulations are 500 nm. 
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Extended Data Fig. 4| Electrical control of DW motion through across 
structure. The directions of the current/andthe DW motion are the sameand 
areindicated for each image (current density 9 x10" Am”, pulse length30ns). 
The edges of the cross structures are indicated with red dashed lines. The 
bright and dark regions in the cross structure inthe MOKE images correspond 
to©and @ magnetization, respectively. All the scale bars are 1pm. 
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Extended Data Fig. 5 | Various cascaded DW logic circuits. a, AND gate one NOT gate. d, Half-subtractor constructed by cascading four NAND gates 
fabricated by cascading one NAND gate and one NOT gate. b, Cascaded DW and one NOT gate. e, Extensive cascaded DW logic circuits including 10 NAND 
logic circuit with a NAND gate and a NOR gate. Greenand purpleinthe gates and 11NOT gates. The bright and dark areas in the device regions inthe 
schematic correspond to © and @ magnetization, respectively. Thereis an MFM images correspond to © and @ magnetization, respectively. The MFM 
inverter placed in the bias reservoir of the NOR gate highlighted with the green images are captured after saturation with an OOP magnetic field to set the 
box, giving a bias of ‘1’, as shown in inset, whereas the bias for the NAND gate initial magnetization direction to© inall ofthe reservoirs, followed by current 


is‘O’.c, Two-bit multiplexer constructed by cascading three NAND gates and pulses to obtain the final states. All the scale bars are 500 nm. 
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Extended Data Fig. 6 | Magnetic DW logic elements. Red and blue shaded 
regions indicate regions that have OOP and IP anisotropy, respectively. The 
direction of the current flow is indicated by black arrows. The dimensions of 
the magnetic DW logic elements used in the experiments are indicated. 
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Extended Data Fig. 7 |DW velocity ina uniform OOP region of aracetrack 
and effective DW velocity ina NOT gate as afunction of current density. 
Error bars represent the standard deviation of the DW velocity measuredin5 
different devices. 
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Extended Data Fig. 8 | Time sequence of MOKE images of the NAND gate 
during operation and corresponding schematics. The NAND gate contains 
two inverters ineach ofthe DW reservoirs anda bias set to ‘0’. The boundaries 
of the NAND gate are indicated by red dashed lines. The two V-shaped inverters 
inthe DW reservoirs are associated with small reversed domains in the initial 
state (purple triangles) resulting from the chiral coupling. Asequence of MOKE 
images is captured and each image is taken following two current pulses witha 
current density of 7.5 x10" A manda pulse length of 30 ns. The bright and 
dark regions inthe gate structure in the MOKEimages correspondto© and @ 
magnetization, respectively. Inthe schematics, green and purple correspond 
to© and @ magnetization, respectively. The two DW reservoirs and the bias are 
set to logical value ‘0’ by applying an OOP magnetic field of 1kOe. All the scale 
bars arelpm. 
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Extended Data Fig. 9 | Propagation delay time for DW logic. a, Schematic 
showing the use of the propagation delay time to improve the operational 
reliability of logic operationina NAND gate. Green and purple correspond to 
© and @ magnetization, respectively. b, MFM images of NAND gates with 
different input racetrack lengths (top, /,=/,; middle, /,>/,; bottom, l,</,;where 
1,and 1, represent the racetrack lengths ofinput a and b, respectively). The 
bright and dark regions correspond to © and @ magnetization, respectively. 
The MFM images arecaptured after saturation with an OOP magnetic field to 
set the initial magnetization direction to © inall ofthe reservoirs, followed by 
application of current pulses to obtain the final states. The direction of the 
current flowis indicated. c, Schematic showing the dependence of the 
probability of giving a correct output as a function of propagation delay time. 
Allthe scale bars in the MFM images are 500 nm. 
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Extended Data Fig. 10 | Hall measurement of logic operation ina NAND gate. 
a, Schematic and optical microscope image of the device. Red, yellow and 
purple colours inthe image represent the regions with electrodes, the Pt cross 
and the NAND gate, respectively. b, Hall resistance asa function of pulse 
number and corresponding MOKE images. Left, typical evolution of the Hall 
resistance with increasing number of pulses. Right, the first 30 and last 30 of 
the 1,172 repeated measurements. The Hall resistance levels for © and @ output 
magnetizations are indicated by the red dashed lines. An OOP magnetic field is 
applied to set the initial state at the beginning of each measurement, indicated 
withthe red arrow. The bright and dark regions in the gate structure in the 
MOKE images correspond to © and @ magnetization, respectively. The 
boundaries of the NAND gate are indicated by red dashedlines. The current 
density and pulse length of the applied current pulses are 7.5 x10" Amand 

30 ns, respectively. c, Operational reliability as a function of the number of 
current pulses. All the scale bars are2 um. 
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Ultrathin two-dimensional (2D) semiconducting layered materials offer great 
potential for extending Moore’s law of the number of transistors in an integrated 
circuit’. One key challenge with 2D semiconductors is to avoid the formation of charge 
scattering and trap sites from adjacent dielectrics. An insulating van der Waals layer of 
hexagonal boron nitride (hBN) provides an excellent interface dielectric, efficiently 
reducing charge scattering”*. Recent studies have shown the growth of single-crystal 
hBN films on molten gold surfaces‘ or bulk copper foils®. However, the use of molten 
gold is not favoured by industry, owing to its high cost, cross-contamination and 
potential issues of process control and scalability. Copper foils might be suitable for 
roll-to-roll processes, but are unlikely to be compatible with advanced 
microelectronic fabrication on wafers. Thus, a reliable way of growing single-crystal 
hBN films directly on wafers would contribute to the broad adoption of 2D layered 
materials in industry. Previous attempts to grow hBN monolayers on Cu (111) metals 
have failed to achieve mono-orientation, resulting in unwanted grain boundaries 
when the layers merge into films®”. Growing single-crystal hBN on such high- 
symmetry surface planes as Cu (111)°* is widely believed to be impossible, evenin 
theory. Nonetheless, here we report the successful epitaxial growth of single-crystal 
hBN monolayers ona Cu (111) thin film across a two-inch c-plane sapphire wafer. This 
surprising result is corroborated by our first-principles calculations, suggesting that 
the epitaxial growth is enhanced by lateral docking of hBN to Cu (111) steps, ensuring 
the mono-orientation of hBN monolayers. The obtained single-crystal hBN, 
incorporated as aninterface layer between molybdenum disulfide and hafnium 
dioxide in a bottom-gate configuration, enhanced the electrical performance of 
transistors. This reliable approach to producing wafer-scale single-crystal hBN paves 
the way to future 2D electronics. 


First, asingle-crystal Cu (111) thin film on a wafer is needed. Single-crystal 
Cuin thick foils can be achieved through recrystallization induced by 
implanted seeds>’. However, for the formation of Cu (111) thin film ona 
wafer, the crystallinity relies strongly on the underlying substrate lattices. 
Here we used ac-plane sapphire as the substrate, on which we sputtered 
a500-nm-thick polycrystalline Cu film and then carried out extensive 
thermal annealing to achieve single-crystal Cu (111) films’. One chal- 
lenge is that Cu (111) tends to form twin grains separated by twin-grain 
boundaries, through kinetic growth processes. Figure 1a illustrates the 
atomic arrangements for the typical twinned Cu (111) structure. We find 
that post-annealing at a high temperature (1,040-1,070 °C) in the pres- 
ence of hydrogen is the key to removing the twin grains, consistent with 


recent reports", Figure 1b, c shows optical micrographs and electron 
backscatter diffraction (EBSD) patterns for the Cu (111) thin films after 
annealing at 1,000 °C and 1,050 °C. The EBSD results (see also Extended 
Data Fig. 1a, b) confirm the coexistence of twinned Cu (111) polycrystals 
with 0° and 60° in-plane misorientation for the Cu thin films annealed 
at 1,000 °C. Thein-plane misorientation was removed after annealing at 
1,050 °C, producing single-crystal Cu (111). The X-ray diffraction results 
(Extended Data Fig. 1c—f) also consistently illustrate our success in obtain- 
ing single-crystal Cu (111) thin films. Note that the Cu (111) is preferentially 
formed witha thinner Cu film, butasufficiently thick Cufilmis necessary 
to prevent Cu evaporation during subsequent hBN growth. Thus, there is 
an optimal Cuthickness (around 500 nm) for single-crystal hBN growth. 
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Fig. 1| Cu (111) lattice orientation on c-sapphire substrates. a, Diagram 
showing twinned Cu (111) (top view) with an in-plane rotation of 60°; the blue 
and red triangles enclose areas with the same Cu stacking configurationin 
adjacent twinned grains. The right-hand coloured triangle is the legend for the 
inverse pole figure (IPF) maps at the bottom of panelsb, c. b, Optical 
microscope image (top) and EBSD IPF mappings (1 x 1mm7?; bottom) ofa Cu thin 


Achieving the growth of mono-oriented hBN triangular flakes is an 
essential step towards obtaining wafer-scale single-crystal hBN. Owing 
to the sixfold symmetry of Cu (111), the van der Waals registry of hBN 
to Cu (111) leads to two sets of energy-minimal configurations (whose 
orientation differs by 60° or 180°, an inversion) with almost degener- 
ate binding energies. Thus it is commonly believed that confining hBN 
flakes to mono-orientation onsuchahigh-symmetry surface is impos- 
sible**. Our experiments reveal that the energy degeneracy can be lifted 
inthe presence of spontaneously present top-layer Cu step edges. The 
growth ofanhBN monolayeris carried out by flowing ammonia borane 
precursors ontothe one-inch single-crystal Cu (111) thin film/sapphire 
in a hot-wall chemical vapour deposition (CVD) furnace. An optical 
micrograph of monolayer hBN triangular flakes grown ona Cu (111) 
thin film with twin grains (Fig. 2a) shows that hBN flakes orient to the 
same direction on one twin and to the opposite direction (or 60° in- 
plane rotation along the zaxis) on the counterpart twin (Extended Data 
Fig. 2). Figure 2b displays optical micrographs of hBN flakes grown on 
a single-crystal Cu (111) thin film without twin grains, where almost all 
of the triangles are unidirectionally aligned (see also Extended Data 
Fig. 3 for statistical analysis of the orientation distribution of hBN tri- 
angular flakes). The observation of mono-orientation on an individual 
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film annealed at 1,000 °C for 1h, where the normal direction (ND) and rolling 
direction (RD) indicate the presence of twinned Cu (111) polycrystals with in- 
plane misorientation by 60°.c, If the Cu thin filmis annealed at 1,050 °C for1h, 
the in-plane orientation is unified, showing the formation of single-crystal 

Cu (111) without any twinned grains. 


single-crystal Cu (111) grain clearly indicates the existence of an energy- 
minimized hBN-Cu (111) configuration. Therefore, eliminating the 
twin grains in Cu (111) will ensure the growth of single-crystal hBN onit. 

To verify the single crystallinity, we characterized the hBN mon- 
olayer merged from mono-oriented triangles by microspot low-energy 
electron diffraction (u-LEED), using a probe size of around 31m at 80 
sites across the one-inch wafer. Figure 2c displays the p-LEED patterns 
from nine randomly selected sites. All results revealan hBN monolayer 
that is unidirectionally aligned with the Cu (111) surfaces, indicating 
that their single crystallinity strictly follows the Cu (111) lattices. The 
atomically resolved scanning tunnelling microscopy (STM) image of 
hBN on Cu (111) in Fig. 2d shows a perfect hBN lattice with a measured 
lattice constant of 2.50 + 0.1A, consistent with the theoretical value 
of 2.5 A. We probed more than 20 locations, and all STM images show 
the same hBN lattice orientation (Extended Data Fig. 4). We did not 
observe any grain boundaries formed by adjacent misoriented hBN 
domains, suggesting the single-crystalline nature of hBN. We note that, 
in some areas, moiré patterns arise from the lattice mismatch and/or 
relatively small rotation (within 1.5°) between hBN and the underlying 
Cu (111) substrate (Extended Data Fig. S5a—f). The magnified atomic- 
resolution image at the moiré boundary areas reveals that the hBN 
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Fig. 2| Growth and atomic structures of single-crystal hBN on Cu (111) films. 
a, Optical microscope image of hBN grown on different Cu (111) grains. Black 
dashed lines indicate twin grain boundaries. Oppositely oriented flakes are 
marked by redand blue dashed triangles. b, Mono-oriented hBN flakes on 
single-crystal Cu (111) films.c, p-LEED patterns of hBN monolayers at nine 
different areas randomly selected from the 1.5 x 1.5 cm? sample surface. All 
p-LEED patterns show that the hBN monolayers havethe same orientationas 
the Cu (111) surfaces. d, Left to right, large-scale STM image (V,,, = -1.008 V; 

hip = 3-90 NA; T=300 K) and atomic-scale STM images (V,;, =—0.003 V; 


tip tip 
hip = 54.508 nA; Vip = - 0.003 V, fin = 46.50 NA; T=300 K) of hBN/Cu (111) with 


tip ip 


presents perfect lattice coherence at the patching boundary (Extended 
Data Fig. 5g-j), indicating that the formation of moiré patterns does not 
affect the overall hBN orientation. We believe that the hBN completes 
the single-crystal growth at high temperatures, and thatthe strain asso- 
ciated with sample cooling after growth results in the formation of local 
moiré pattern. Other characterizations, including X-ray photoelectron 
spectroscopy (XPS) and Raman spectroscopy, prove the B-N chemi- 
cal bonding structures (Fig. 2e, f). Transmission electron microscope 
(TEM) and atomic force microscope (AFM) images consistently show 
that the as-grown hBN is indeed a monolayer (Fig. 2g, h). 

We recognize that, once the Cu (111) thin films are prepared and 
formed untwinned at 1,050 °C as described above, the mono-orientated 
growth of hBN flakes can be realized at various growth temperatures 
ranging from 995 °C to 1,070 °C (Extended Data Fig. 6). However, a 
lower growth temperature (995 °C to 1,010 °C) usually leads to lower- 
quality hBN flakes, which are easily oxidized in a subsequent oxidation 
test at 150 °C in air. Therefore, we used a higher growth temperature 
(typically 1,050 °C) to ensure high-quality single-crystal hBN growth. 

To explain the preferred orientation of hBN on Cu (111), we consider 
a small and rigid B,N, molecule (that is, an energetically favourable 
N-terminated three-ring structure”) as the probe seed. Wefirst examine 
the effect of plane-to-plane epitaxy, using density functional theory 
(DFT) to calculate the binding energies of six typical atomic stacking 
configurations (Fig. 3a), where N,By,, N,,B,, and N,,B, are defined as the 0° 
orientation, and N,B,, N;,By, and N,,B, are the 60° (inverted) orientation. 
The notation N,B, represents the stacking of N atoms in registry with 
(above) the Cu atoms in the ith layer, while B atoms register with the 
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measured lattice constant 2.50 + 0.1A. We measured the height of each step 
from the left-hand image to be roughly 2 A.e, XPS spectra measured from an 
as-grown monolayer hBN film on Cu (111)/sapphire. The binding energies of the 
Bl1sandN Isorbitals at 190.4 eV and 398.0 eV confirm the formation of hBN. 

f, Raman spectrum ofa transferred hBN film, where the £,, mode at1,371cm™ 
witha full-width at half-maximum (FWHM) of 14.5cm ‘confirms that the hBNis 
a monolayer. g, Cross-sectional TEM image of a monolayer hBN transferred 
onto SiO,/Si, where the thickness of hBN is around 0.5 nm.h, AFM image of a 
single-crystal hBN film transferred ontoa SiO,/Si substrate. 


Cuatoms in the jth layer. The calculations show that the stacking with 
N atoms ontop of the first-layer Cu atoms (N,B,, (0°) and N,B, (60°)) 
exhibits the lowest energy, while B atoms on top of the first-layer Cu 
atoms (N,,B, (0°) and N,,B, (60°)) are energetically unfavourable. The 
preferential registrations reflect the electron affinity of the Band N 
atoms, which leads to attractive (or repulsive) Coulomb interactions 
between N (orB) atoms andthe first-layer Cu atoms, and hence affects 
the structural stability. We find that the lowest-energy structures for 
0° (N,B,,) and 60° (N,B,) orientations exhibit an energy difference of 
only 0.05 eV or so, much smaller than the thermal energy k,7 at the 
growth temperature (roughly 0.1eV), indicating that the plane-to-plane 
registry is insufficient to achieve mono-oriented growth, inagreement 
with simulations”. 

In fact the Cu (111) surface is not perfectly flat and many terraced 
meandering steps exist, as revealed in STM images (Fig. 2d and Extended 
Data Fig. 7a—d). Recent theory showed that one must consider the role 
of these step edges in guiding hBN growth”. Other work’ suggests that 
docking at the vicinal step edges on the Cu (110) surface governs the 
single-crystal hBN growth, based onthe assumption that the Cu terrace 
steps trend only up or downall the way across the whole vicinal surface 
of the Cu foil. However, our STM results clearly reveal that the terrace 
steps of the Cu (111) surface trend both up and down across the wafer, 
and that the edge-docking can seemingly yield hBN in both directions, 
unless the binding energies differ enough to favour one direction over 
another. Tocapture this in our model, we add anextra layer of Cuatoms 
(red in Fig. 3) ontop of the first layer, forming two opposite step edges 
(A- and B-step edges in Fig. 3a and Extended Data Fig. 7f). This restricts 
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Fig.3|DFT calculations of epitaxy at nucleation, with and without energies for the six B,N,-Cu (111) configurations, with and without including 
considering step-edge docking. a, Lowest-energy atomic arrangements for the edge-to-step epitaxy effect. The bottom image is a three-dimensional view 
six B,N,-Cu (111) configurations, considering the edge docking to two typical of the N,B,-Cu (111) configuration. 


step-edge terminations (A and B) of top-layer Cu (111). b, Calculated binding 


the B,N, seed to the 0° (or 60°) orientation when docking tothe A(or way withthe presence of Cu step edges (Fig. 3b): the two configurations 
B) steps (Fig. 3a). In Fig. 3a, bthe distance between the Custepedge _N,B,, (60°) and N,B,, (0°), which were nearly degenerate when consider- 
and the B,N, zigzag edge for each configuration hasbeendetermined ing only plane-to-plane epitaxy, are now separated by 6£ values of 
by energy minimization (Extended Data Fig. 8). Theresulting binding approximately 0.23 eV, this value raising in proportion with the 
energy for each configuration changesinasubtle yet veryimportant docking length, rapidly amplifying the Boltzmann selectivity factor 
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Fig. 4| Schematic and photographsillustrating the wafer-scalehBNtransfer — the anode, with an applied DC voltage of 4 V. During this process, the TRT/ 


processes. a, As-grown two-inchhBN film ona Cu (111)/sapphire wafer. Top, PMMA/hBN stacked film is detached from the Cu (111)/sapphire through the 
schematic; bottom, photograph. b, Electrochemical delamination, using an generation of hydrogen bubbles at the hBN/Cu interface. PMMA, poly(methyl 
aqueous solution of NaOH (1M) as the electrolyte, the Cu layer inthe TRT/ methacrylate); TRT, thermal release tape. c, Two-inch monolayer hBN film 


PMMA/hBN/Cu (111)/sapphire stack as the cathode, and a platinum (Pt) foilas transferred ontoa four-inch SiO,/Si wafer. 
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e®!/ks! (for a contact length of just five to six hexagons, the Boltzmann 
selectivity factor increases to more than 10°; Extended Data Fig. 9). 
Such an energy difference apparently ensures the mono-orientated 
growth. Our STM results (Extended Data Fig. 7) show that all meander- 
ing steps are rather curved and locally rugged, so that they all consist 
of segments of A and B types. BN seeds should kinetically nucleate at 
the AtoB corner while docking to stronger binding sites, Btypes, with 
proper orientation (Extended Data Fig. 7e).The simulations, together 
with experimental results, indicate that Cu (111) surfaces with step 
edges are the key to achieving single-crystal hBN growth. 

Following our success in growing single-crystal hBN ona one-inch 
Cu (111) thin film, we further scaled the growth to a two-inch wafer, as 
depicted in Fig. 4a. Given that the interaction between a fully grown hBN 
layer and Cu (111) is limited to weak van der Waals forces, the detach- 
ment of wafer-scale hBN can be achieved by polymer-assisted transfer 
with the help of electrochemical processes»”* (Fig. 4b and Extended 
Data Fig. 10). Figure 4c shows a photograph of a two-inch hBN mon- 
olayer transferred onto a four-inch SiO,/Si wafer. We show that the 
growth of single-crystal wafer-scale hBN on Cu (111) thin films is scal- 
able and much more cost-effective than using thick Cu foils or other 
metals, andthus could bea preferred approach for the microelectron- 
ics industry. The availability of wafer-scale single-crystal hBN should 
stimulate and enable further research and development of futuristic 
2D electronics. We constructed monolayer MoS, field-effect transis- 
tors (FETs) with and without single-crystal and polycrystalline hBN as 
aninterface dielectric ina bottom-gate configuration (Extended Data 
Fig. 11). The enhancement of mobility in MoS, and the suppression of 
hysteresis are substantial inthe device with a single-crystal hBN mon- 
olayer, suggesting its promise for 2D-based transistors. 
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Methods 


Growth of Cu (111) thin films 

The as-received sapphire substrates were first etched ina mixed H,SO,/ 
H,PO, aqueous solution at 300 °C for 20 min. The etched sapphire 
substrates were cleaned by immersion in ultra-pure water for 5 min. 
After cleaning, the sapphire substrates were loaded into a sputtering 
chamber for Cu deposition. The chamber was maintained at room 
temperature under an argon pressure of 0.3 mtorr, giving rise to a Cu 
deposition rate of 2nms7 (ref.*°). 


Chemical vapour deposition of hBN 

The monolayer hBN films were grown ina three-inch furnace tube with 
three heating zones using low-pressure chemical vapour deposition 
(LPCVD). Thetwo-inch Cu (111)/sapphire substrate was placed in the cen- 
tral heating zone of the main chamber. Ammonia borane (97%, roughly 
60 mg) was used as the precursor and loaded intoasubchamber at the 
upstream side of the main chamber. The furnace was first pumped down 
toa base pressure of 5.0 torr. Before growth, the substrate was annealed 
at 1,050 °C for 60 min under a hydrogen gas flow of 300 standard cubic 
centimetres per minute (sccm). The subchamber (with precursors) was 
heated to 85 °C using a heating belt and maintained there for 30 min. 
The precursor was then introduced into the main chamber for 30 min 
in order to grow hBN on the Cu (111)/sapphire substrates, with the 
substrates facing downwards. After hBN growth, the subchamber was 
closed, andthe main chamber was naturally cooled to room temperature 
under a hydrogen gas flow of 30 sccm. 


Transfer of hBN films onto arbitrary substrates 

The as-grown monolayer hBN film was detached from the Cu (111)/ 
sapphire substrate by electrochemical delamination. A poly(methyl 
methacrylate) (PMMA) film was first spin-coated on the as-grown hBN/ 
Cu (111)/sapphire as a protection layer. Then a thermal release tape 
(TRT; catalogue number 3195M from Nitto) was applied to the PMMA/ 
hBN/Cu (111)/sapphire in order to avoid possible folding during the 
transfer process. Electrochemical delamination was performed using 
an aqueous solution of NaOH (1M) as the electrolyte, with the Cu layer 
in the TRT/PMMA/hBN/Cu (111)/sapphire stack as the cathode and a 
platinum foil as the anode, under an applied DC voltage of 4 V. During 
this process, the TRT/PMMA/hBN stacked film was detached fromthe 
Cu (111)/sapphire through the generation of hydrogen bubbles at the 
hBN/Cu interface. After detachment, the TRT/PMMA/hBN stacked film 
can be placed onthe target substrate. The TRT can be released by bak- 
ing the TRT/PMMA/hBN/substrate ona hot-plate at 180 °C. The PMMA 
film was finally removed by immersing the sample in hot acetone for 
40 min, leaving behind a monolayer hBN film on the target substrate. 


Chemical vapour deposition of MoS, 

Highly oriented monolayer MoS, was grown on sapphire substrates by 
CVD in a horizontal hot-wall three-inch furnace tube with two heating 
zones. The maximum substrate size can be a wafer of up to two inches. 
High-purity sulfur (99.5%, Alfa) and MoO, (99%, Aldrich) powders were 
usedas the reaction precursors. The sulfur powder was placed in the front 
heating zone at the upstream side of the furnace, and the temperature was 
maintained at 140 °C during the reaction. The MoO, powder was put into 
a quartz boat inthe central heating zone of the furnace. The temperature 
of this central heating zone was gradually ramped to 740 °C and held for 
5 min. During this process, MoS, was grown on the sapphire substrates 
placed at the downstream side of the MoO, quartz boat. All growth was 
performed in argon flowing gas (90 sccm) at a base pressure of 30 torr. 
Finally, the furnace was naturally cooled to room temperature”. 


Transfer of monolayer MoS, 
After CVD growth, monolayer MoS, films on sapphire were 
transferred onto a target substrate using PMMA. PMMA was first 


spin-coated onto the as-grown MoS,/sapphire as a protection 
layer. Then a TRT was applied to the PMMA/MoS,/sapphire in order 
to avoid possible folding during the transfer process. The TRT/ 
PMMA/MoOS,/sapphire stacked film was immersed in an NH,OH 
solution (NH,OH/deionized water, 17/100) at 100 °C for 20 min. 
Next, the detached TRT/PMMA/MoOS, stacked films were immersed 
in deionized water in order to dilute the etchant and residues. The 
TRT/PMMA/MoOS, stacked film is placed on the target substrate; 
the TRT is released by baking on a hot-plate at 180 °C; and the 
PMMA film is finally removed by immersing the sample in hot 
acetone for 40 min, leaving behind a monolayer MoS, film on the 
target substrate”. 


LEEM/PEEM/p-LEED characterizations 

We carried out structural analysis of hBN using low-energy electron 
microscopy (LEEM) with a field-emission gun, photoemission elec- 
tron microscopy (PEEM) excited bya mercury lamp, and microspot 
low-energy electron diffraction (u-LEED, with an area of about 
3 um in diameter). The system (Elmitec LEEM-III) is composed of 
a preparation chamber, a main chamber for imaging, and a deep 
ultraviolet laser'®. The sample was first annealed in the prepara- 
tion chamber in ultrahigh vacuum (1 x 10°’ torr) at 600 °C for6h 
and then transferred to the main chamber for LEEM/PEEM/t-LEED 
analysis. 


STM characterizations 

We used an Omicron ultrahigh vacuum (UHV) variable-temperature 
(VT)-STM system for atomic-scale structural characterization, witha 
base pressure better than 10°” mbar. The hBN/Cu sample was annealed 
at 900K for 5.5h before STM measurements. All of the the STM images 
were captured under a constant-current mode at room temperature. 
The atomic-scale morphology of hBN/Cu was analysed directly by high- 
resolution STM. 


EBSD characterization 

EBSD was performed on aJEOLJSM-7800F PrimeSEM with collection 
accessory (from EDAX) at an accelerating voltage of 20 kV and thesam- 
ple stage tilting at 70°. 


TEM characterization 

We obtained TEM images from transferred hBN/SiO,/Si with an accelera- 
tion voltage of 200 keV using a FEI Tecnai Osiris transmission electron 
microscope. 


AFM characterization 
AFM data were acquired in tapping mode over the scan area, using a 
silicon tip. 


XPS measurements 

XPS spectra were obtained using a Perkin Elmer PHI 5400 system 
equipped with hemispherical analyser with an overall resolution of 
0.05 eV. The energy span and energy scale were calibrated by setting 
the 4f,,. line of gold and the 2p,,, line of copper to 84 eV and 932.67 eV, 
respectively. XPS spectra were measured from the as-grown monolayer 
hBN film on Cu (111)/sapphire. The B 1s and N Is emission peaks con- 
firm the formation of hBN. The binding energies of B 1s and N Is are 
located at 190.4 eV and 398 eV, respectively. The B/N atomic ratio is 
calculated from the integrated intensities of these peaks, yielding a 
ratio of 1/1.03, indicative of a good stoichiometry for the CVD-grown 
monolayer hBN films. 


Raman measurements 

The Raman measurements were carried out on transferred hBN 
film using a 532-nm solid-state laser as the excitation source. Excita- 
tion light with a power of 2.5 mW was focused onto the sample with 


a x100 objective lens (with a numerical aperture of 0.9). The signal 
was collected using the same objective lens, analysed with a 0.75-m 
monochromator and detected with a liquid-nitrogen-cooled charge- 
coupled-device (CCD) camera. The £,, band of hBN is located at 
1,371 cm“ and its FWHM value is 14.5 cm”, confirming that the trans- 
ferred hBN film is monolayer thick with high crystalline quality. 


X-ray diffraction measurements 

X-ray diffraction (XRD; Bruker D8-Discover) 6-26 and 360° azimuthal 
(~) scans were conducted using a Cu Ka radiation source (A=1.54 A). An 
XRD azimuthal g scan of twinned Cu (111) films was operated at y=70.5°, 
@=16.2° and 20 = 43.3°; an XRD azimuthal @ scan of untwinned Cu (111) 
films was operated at y = 70.5°, w = 21.5° and 20 = 43.3°. 


First-principles calculations 

First-principles calculations were carried out using DFT as implemented 
inthe Vienna ab initio simulation package (VASP) within MedeA soft- 
ware”’, We used the projector augmented wave method, the exchange- 
correlation potential described by the Perdew-Burke-Ernzerhof (PBE) 
generalized gradient approximation (GGA)”,, and van der Waals cor- 
rection vdW-DF (optB86b) functionals” in order to calculate the dis- 
tance and binding energy between BN molecules and copper substrates. 
For the plane-to-plane and edge-to-step epitaxy model, a rigid B,N, 
moleculeis physically absorbed onto a6 x 6and a6 x 7,/3 Cu (111) sur- 
face witha thickness of four copper layers, using a2 x 2xlanda2x1x1 
k-grid, respectively. To eliminate spurious interactions resulting from 
the slab model of the supercell, the vacuum thickness is larger than 
17A, the in-plane distance of aB,N, molecule away from another side 
of the step edge is at least 10 A, and the energy cutoff of the plane waves 
is 400 eV. The lattice constant of Cu (111) usedin simulation is a=2.6A 
at a reaction temperature of 1,000 °C, with hBN assuming the same 
lattice constant for simplification. The interlayer distance of Cu (111) 
is 2.05 A. The height of the BN molecule from the Cu surface is optimized 
until the change in the energy and the force reach10*eVand107eVA*, 
respectively. For plane-to-plane epitaxy, the optimized vertical dis- 
tances from the top layer of the Cu surface, d, for six configurations 
ared=1.98 A for N,B,, d=1.77 A for N, By, d=2.03 A for NyB, d=1.98 A 
for N,By, d=1.77 A for Ny,B,, and d= 2.03 A for N,B,. Binding energies 
are defined by —F, = Egenz-cu Epon ~ Ecur Where Egenz-cy is the total energy 
of the hBN flake/Cu (111) system, and Fy, and E,, are the energies of 
the B,N, flake and Cu substrate (with or without a step edge); the cal- 
culated binding energies of six total stacking structures in the presence 
of a step are plotted by varying the in-plane distance, D,, to the step 
edge. The six lowest energetic configurations in the presence of astep 
are located at D; = 2a/./3 for NB, (d=1.99 A), 4a/./3 for N,By, (d=1.77 A), 
1.5a/./3 for N,,B, (d= 2.08 A), 2.5a/./3 for N,B,, (d=1.98 A),1.5a/\/3 for 
Ny By (d=1.84 A), and 2a/./3 for N,B, (d=2.06 A) (Extended Data Fig. 8). 
Note that when D, is less than ./3 a/3 (too close to the step-edge), the 
energy dramatically increases. 

We study here the energy difference between N,B,, (0°) and N,B,, (60°), 
the two lowest-energy structures docking to A-step and B-step edges, 
with different lengths of docking contact. We construct a model of a 
BN stripe composed of aromatic rings docking to the step edge. We 
find that the energy difference between the two configurations N,By, 
(0°) and NB, (60°)—an indicator of selectivity for mono-orientation— 
increases rapidly with the docking contact, and approaches a 6£ of 
roughly 0.78 eV, amplifying the Boltzmann selectivity factor e/*s" 
(witha k,7 of around 0.11 eV at 1,300 K) for acontact length of just five 
to six hexagons to more than 10° (Extended Data Fig. 9a, b). The large 
energy difference means that the step edge plays the role of ‘orientation 
filter’, allowing a predominant phase of BN (a mono-orientation) dur- 
ing growth. Factually, the competing subcritical nuclei must be larger 
and, especially, are probably elongated along the step-edge to allowa 
more energetically favourable contact. Obviously such an energy value 


roughly scales with the contact length, C, and the selectivity factor 
increases very rapidly with size, as exp(C). 

The effect of misfit and misalignment of BN to the step edge can 
be studied by calculating the binding energy of a B,N, molecule at a 
three-hexagon contact length with a small tilt angle along the step 
edge (the aligned structure with a tilt angle of zero corresponds to 
N,B,, and N,B,). The lattice constants of Cu (111) and hBN are allowed 
to be different as dq, = 2.6 A and djpy=2.5A, with a3.8% lattice misfit. 
We find that, although the total binding energy is weakened because 
of lattice misfit (that is, there is a weaker interaction of plane-to-plane 
epitaxy), the energy difference between the 0° and 60° orientations 
remains nearly the same (from 0.3 eV to 0.27 eV). The calculated binding 
weakens as the tilted angle becomes larger, indicating that the most 
stable configuration during the initial nucleation growth occurs when 
dockingis tight and well aligned to the step edge (Extended Data Fig. 9c, 
d). Theresult suggests that mono-orientated hBN was epitaxially grown 
on Cu (111) ina manner that was guided mostly by step edges, and that 
the moiré was formed to release the strain between Cu (111) and hBN 
(lattice mismatch or surface topography) after alarge area of film had 
grown. We note that the cooling step after large-area hBN growth will 
result in strains that might be released by local straining of hBN and 
formation of a moiré pattern. Therefore, we believe that moiré does 
not actually affect the growth process, but occurs afterwards. 


Data availability 


All data needed to evaluate our conclusions are found in the main text 
and the Extended Data. Further data related to the paper are available 
from the corresponding authors on reasonable request. 
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Extended Data Fig. 1| Analysis of Cu (111) crystal orientation onc-sapphire 
substrates. a, EBSD inverse pole figure (IPF) mapping (Imm x 1mm) ofthe Cu 
substrate annealed at 1,000 °C for 1h. The normal direction (ND), transverse 
direction (TD) and rolling direction (RD) mappings, as indicated by the 
triangular colour map, show that Cu (111) is polycrystal. The line scan of 
misorientation onthe RD map indicates an in-plane 60° rotation. b, IPF map of 
the Cu substrate annealed at 1,050 °C for 1h; the film is characterized as single- 
crystal Cu (111), and no twinned grain is founded.c,d, XRD 6-26scans of the 
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Cu (111)/c-sapphire substrate annealed at 1,000 °C for 1h (c) and annealed at 
1,050 °C for 1h (d), revealing a Cu (111) peak at 20=43.3°. e, f, XRD gscans of the 
Cu (111)/c-sapphire substrate annealed at 1,000 °C for 1h with an in-plane 
rotation of 60° (e) and annealed at 1,050 °C for 1h with an in-plane rotation of 
120° (f). Note that the sample annealed at 1,050 °C shows the signature ofa 
single crystal without in-plane misorientation, because an hBN triangle has C, 
symmetry and issymmetric after a120° rotation. 


Extended Data Fig. 2 | Single-oriented hBN flakes in different Cu grains. 

a, Optical micrograph of hBN molecules (triangles) grown on different Cu (111) 
grains (the black dashed line indicates the boundary between adjacent twin 
grains). Oppositely oriented hBN flakes are marked by red and blue circles. 
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b, EBSD IPF maps of the areashown ina. The misorientation-of-line scan 
indicates that the twin grain is rotated by 60° (grainato grainb). The RD map 
clearly shows the difference in in-plane orientation between grains aand b. 
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Extended Data Fig. 3 | Statistical analysis of the orientation distribution of circles. b, Statistical analysis of the optical micrograph froma; more than 99.6% 
triangular hBN flakes.a, Optical micrgraph of hBN grownonaCu (111)/ of the hBN flakes are aligned in one direction on Cu (111). 
c-sapphire substrate at 1,050 °C. Misaligned hBN flakes are marked by red 
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Extended Data Fig. 4| STM images from randomly selected locations. a—h, Images of a1.5 x 1.5 cm? hBN film grown on Cu (111)/c-sapphire, where the angle (6) 
between the hBN lattice orientation (black arrow) and the horizontal line (black dashed line) in each image is 26.5° + 1° indicating that the hBN isa single-crystal 
film. 
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Extended Data Fig. 5 | STM characterization of the atomic structure of 
monolayer hBN on Cu (111). a, Large-scale STM image (V,;, =-1.008 V; 

Ip =3.90 nA; T=300 K) of hBN/Cu. b, Atomic-scale STM image (V,,, =-0.003 V; 
hip = 46.50 nA; T=300 K) of hBN/Cu. c, Typical STM image (V,;, = -0.003 V; 

Ip =8.91nA; T= 300K) of hBN on Cu (111) witha relative rotation angle, 6, of 
approximately 3.3°, showing a moiré pattern witha period of 4.20 nm. 

d, Typical STM image (V,;, =-0.032 V; |, =18.51 nA; T=300 K) of hBN on Cu (111) 
witha 6 of roughly 1.5°, showing a moiré pattern with a period of 7.75 nm. The 
unit cell of the moiré pattern is highlighted by a black rhombus. e, Magnified 
STM image (V,,, =-0.039 V; /,;)=18.51 nA; T=300 K) of hBN on Cu (111) witha 0 of 
approximately 1.5°. f, Simulation of the moiré pattern for monolayer hBNon 

Cu (111) with a 6 of roughly 1.5°. The unit cell of the moiré pattern for hBN/ 

Cu (111) is highlighted by a black rhombus. The large-scale STM image ina 
showsa large-area flat terrace of hBN/Cu witha clean surface. The atomic-scale 
STM image in breveals ahoneycomb structure with a lattice constant of 
roughly 0.25 nm, which coincides well with the lattice parameters of hBN. 
Notably, in some typical regions of hBN/Cu (111), moiré patterns with different 
periods are observed. For instance, dand eshow amoiré pattern witha period 
of around 7.75 nm, and cshows another with a period of roughly 4.20nm. 

Such patterns arise from the lattice mismatch and/or relative rotation 
between hBN and the underlying Cu (111) substrate, and the moiré periods (D) 
correlate with the relative rotation angles (9) between hBN and Cu (111) as”> 
D=(1+6)a/2(1+6)(1- cos) + 6, where 6is the lattice mismatch (roughly 2%) 
between hBN and the Cu (111) lattice’, and ais the lattice constant of hBN. 


Consequently, the 6 for the moiré pattern with aD of around 7.75 nm (d,e) is 
calculated to be around 1.5°, and the simulated moiré pattern generated from 
monolayer hBN stacking on Cu (111) witha 6 of roughly 1.5° (f) fits well with the 
STM result (d, e). The 6 for the moiré pattern with a D of around 4.20 nmis 
calculated to be roughly 3.3° (c). g-j, STM images showing the boundary 
between areas with and without moiré pattern. g, Typical STM image 

(Viip =-0.003 V; Ip = 8.10 nA; T=300 K) of hBN/Cu at the boundary site. 

h, Magnified STM image (V,,, = -0.003 V; /;;,= 8.10 nA; T=300 K) of the 
boundary ing (highlighted by the black square), showing that the hBN lattices 
present perfect coherence at the boundary site. i, Magnified STM image 

(Viip = -0.003 V; hip = 17.93 nA; T=300 K) of hBN/Cu without moiré pattern in 
region 1 of g.j, Magnified STM image (V,,, = -0.003 V; /;;,=10.78 nA; T=300K) 
of hBN/Cu with moiré patternin region 2 of g. The hBN atomic rows in region1 
and region 2 are along the same direction (black arrows). The STM image in g 
was captured ata typical boundary region, with moiré (region 2) and non-moiré 
(region 1) areas. The magnified atomic-resolution image at the boundary site 
inh shows that the hBN presents perfect lattice coherence at the patching 
boundary. The hBN atomic rows in the two adjacent regions are along the 
same direction (i,j). Allimages suggest that the hBN is aligned well and has 
mono-orientation, indicating epitaxial growth of hBN on Cu (111), and that the 
formation of moiré pattern does not affect the hBN orientation. We believe that 
the hBNcompletesits single-crystal growth at high temperatures, and that the 
strain associated with sample cooling after growth results in local moiré 
pattern. 
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Extended Data Fig. 6| Optical micrograph of hBN grown on Cu (111)/ hBN flakes grownat 995 °C and 1,010 °C are easily damaged after oxidation at 
c-sapphire at different temperatures. All as-grown hBN/Cu (111) was slowly 150 °Cin air. c,d, All hBN triangles point in the same direction in the same 
oxidized at 150 °C inairin order to enhance the contrast between hBNand Cu. Cu (111) grain after growth at 1,025 °C and1,055 °C, and triangles seem 
Black dashed lines indicate Cu twin grain boundaries. Regions witha red unchanged after oxidation. e, After growth at high temperature (1,070 °C), all 


dashed outline are magnified in lowerimages.a, hBN grownat995°C,withhBN —hBNflakesaligninthe same direction, but show different (stretched) shapes. 
triangles aligned in the same Cu grain. b, hBN grown at 1,010 °C. Notethat the 
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Extended Data Fig. 7 |STM images of hBN/Cu (111). a-d, STM images of hBN/ kinetically nucleates at B-to-A corners while docking to stronger binding sites 
Cu steps without moiré pattern (a—c) and with moiré pattern (d), showing that (B steps) with proper orientation. f, Atomic model of step edges on the top 
step edges are often observed in our Cu (111) crystals. e, Diagram showing that Cu (111) surface, showing A and B steps. 
the meandering steps consist of segments of AandB types, and that BN 
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Extended Data Fig. 8 | Calculated binding energy of sixtypicalB,N;-Cu(111) = as a function of D, (inunits of 3 a/3, where ais the lattice constant of Cu (111)), 
configurations, taking into account docking to top-layer Cu step edges. withthe lowest-energy configurations marked with arrows. Plane-to-plane 


a, Atomic models of six configurations, showing the optimized in-plane epitaxy corresponds to the absence of (or infinity distance away from) the step 
distance, D,, away from A-step (red) or B-step (blue) edges. b, Binding energies 
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Extended Data Fig. 9| Comparison of planar epitaxy and step-edge docking 
of hBN on Cu (111). a, Atomic models of fully docking configurations, N,B, (0°) 
and N,B,, (60°). b, Calculated binding-energy differences (left axis, black 
circles) and corresponding Boltzmann factors (right axis, red squares) between 
N,B,,and N,B,, for stripes of BN on Cu (111) asa function of docking length (in 
number of hexagons) on the top-layer Cu step edge.c, d, Calculated energy of 
misfit B,N,-Cu (111) configurations asa function of small tilted angle along with 
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top-layer Cu step-edge docking. c, Atomic models of B,N,—-Cu (111) 
configurations docking toA steps or Bsteps at atilt angle of 0. The aligned 
structures (@=0°) correspond to N,B, and N,B,, (the two lowest-energy 
structures docking in the vicinity of A-step and B-step edges). The lattice misfit 
between hBN and Cu (111) is 3.8% (Qypy=2-5 A; Acuaiy = 2.6 A). d, Binding varies 
with the tilted angle along the step edge. 
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Extended Data Fig. 10 | Diagrams and photographsillustrating hBNtransfer. During this process, the TRT/PMMA/hBNstacked film is detached fromthe 
a, APMMA film is first spin-coated onthe as-grown hBN/Cu (111)/sapphireasa Cu (111)/sapphire through the generation of hydrogen bubbles at the hBN/Cu 


protection layer. A TRT is then applied to the PMMA/hBN/Cu (111)/sapphirein interface.c, The TRT/PMMA/hBN stacked film is placed on the four-inch SiO,/Si 
order to avoid folding during the transfer process. b, Electrochemical substrate. d, TRT canbe released by baking the TRT/PMMA/hBN/substrate ona 
delaminationis carried out using an aqueous solution of NaOH (1M) as the hot-plate at 180 °C. e, The PMMA film is removed by immersing the sample in 


electrolyte, the Cu layer inthe TRT/PMMA/hBN/Cu (111)/sapphire stack as the hot acetone for 40 min, leaving behind atwo-inch monolayer hBN film onthe 
cathode, and a platinum foil as the anode, with an applied DC voltage of 4 V. four-inch SiO,/Si wafer. 
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Extended Data Fig. 11| FETs built with and without hBN as an interface 
dielectric ina bottom-gate configuration. a—d, Diagrams showing the 
various MoS, FET device structures examined and their transfer characteristics 
(drain current, /,, versus gate voltage, V,) ata driving voltage (V,,) of O.1V. 
Monolayer hBN and MoS, are synthesized by CVD (see Methods). a, The hBN 
film is transferred onto a Si substrate with al10-nm HfO, dielectric layer, and 
topped witha single layer of MoS,. The metal contacts are processed by 
photolithography, and evaporated with Ti (Snm) and Pd (50 nm) using ane-gun 
evaporator. b-d, Top, diagrams showing a typical monolayer MoS, FET on HfO, 
(b),an MoS, monolayer ona polycrystalline (PC) hBN monolayer (c),and an 
MoS, monolayer ona single-crystal (SC) hBN monolayer (d). Bottom, /, versus 
V,. The extracted two-probe electron mobility at room temperature in vacuum 
(mw) is2.9cm?Vs", 6.8cm?Vstand11.8 cm?Vs", respectively. The mobility of 
MoS, improves substantially (by about an order of magnitude) by replacing 
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polycrystalline hBN witha single-crystal hBN. A suppressed current hysteresis 
is also observed when using single-crystal hBN as the buffer layer, and the 
subthreshold swing (SS) also improved from111 mV dec‘ to 76 mV dec”. The 
results indicate that single-crystal hBNcan be used to enhance the electrical 
performance of 2D-based transistors. e, Diagram showing the hBN metal- 
insulator-metal (MIM) structure used to examine the quality of hBN. f,/-V 
curves for MIM tunnel junctions with either a polycrystalline ora single-crystal 
hBN monolayer sandwiched between Pt/Ti (top) and Cu (bottom) electrodes. 
Electrical contacts were fabricated by photolithography and e-gun 
evaporation of Ti(5nm) and Pt (40 nm) to form 100 x 100 pm? pads onas-grown 
hBN/Cu/c-sapphire substrate. The device with single-crystal monolayer hBN 
exhibits a large breakdown voltage (of around 0.1V), whereas the device with 
polycrystalline monolayer hBN shows direct tunnelling characteristics. 
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Large-scale energy storage is becoming increasingly critical to balancing renewable 
energy production and consumption’. Organic redox flow batteries, made from 
inexpensive and sustainable redox-active materials, are promising storage 
technologies that are cheaper and less environmentally hazardous than vanadium- 
based batteries, but they have shorter lifetimes and lower energy density”. Thus, 
fundamental insight at the molecular level is required to improve performance’. 
Here we report two in situ nuclear magnetic resonance (NMR) methods of studying 
redox flow batteries, which are applied to two redox-active electrolytes: 
2,6-dihydroxyanthraquinone (DHAQ) and 4,4’-((9,10-anthraquinone-2,6-diyl)dioxy) 
dibutyrate (DBEAQ). In the first method, we monitor the changes in the'H NMR shift of 
the liquid electrolyte as it flows out of the electrochemical cell. In the second method, 
we observe the changes that occur simultaneously in the positive and negative 
electrodes in the full electrochemical cell. Using the bulk magnetization changes 
(observed via the 'H NMR shift of the water resonance) and the line broadening of the 
'H shifts of the quinone resonances as a function of the state of charge, we measure the 
potential differences of the two single-electron couples, identify and quantify the rate 
of electron transfer between the reduced and oxidized species, and determine the 
extent of electron delocalization of the unpaired spins over the radical anions. 

These NMR techniques enable electrolyte decomposition and battery self-discharge 
to be explored in real time, and show that DHAQ is decomposed electrochemically via 
areaction that can be minimized by limiting the voltage used on charging. We foresee 
applications of these NMR methods in understanding a wide range of redox processes 
in flow and other electrochemical systems. 


Improved characterization methods of studying redox flow batteries 
are needed to enhance the performance and extend the battery life of 
both organic- and inorganic-based chemistries. Ex situ characterization 
can be challenging, owing to the high reactivity, sensitivity to sample 
preparation and short lifetimes of some of the oxidized and/or reduced 
redox-active molecules and ions within the electrolytes. However, one 
of the distinct features of redox flow batteries is the decoupling of their 
energy storage and power generation, which provides different oppor- 
tunities for in situ monitoring. So far, methods such as in situ optical 
spectrophotometry’ and electron paramagnetic resonance (EPR)’ have 
been used to study, for example, crossover of quinones and vanadyl 
ions, but considerable opportunities remain to improve characteriza- 
tion methods to address limitations inherent to each method and to 
probe different phenomena. NMR spectroscopy has previously been 
used to study benzoquinone and polyoxometalate redox reactions inan 
in situ static electrochemical cell*°. Here we proceed astep further by 
using NMR tostudy species in flow via two different methods: probing 


the electrolyte in the flow path (on-line detection), or inthe battery cell 
(operando detection). 


The two in situ NMR setups 

In on-line detection (Fig. 1a and Extended Data Fig. 1a), the battery is 
positioned outside the NMR magnet (300 MHz) and one electrolyte 
solution is pumped through a flow apparatus in the NMR probe, ena- 
bling the study of either the catholyte or the anolyte. The setup requires 
minimum modification of a laboratory-scale flow battery and can be 
easily adapted to other solution NMR instruments and coupled with 
other analytical (flow) characterization methods. For operando detec- 
tion (Fig. 1b and Extended Data Fig. 1b), a miniaturized flow battery cell 
is positioned inside the detection region of the NMR probe, enabling the 
simultaneous study of the catholyte and anolyte in the battery cell. The 
majority of the data presented below are acquired withthe on-line detec- 
tion scheme unless otherwise noted, owing to its higher sensitivity— 
the on-line setup has a larger sampling volume (7.3 cm?) than the 
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Fig. 1| Schematics of thetwoinsitu NMRsetups. a, In the on-line setup, the 
battery comprises 5.0 cm? carbon felt electrodes, with acatholyte and an 
anolyte of potassium ferrocyanide(I1) and AQ, respectively, dissolved in 
1MKOH inD,0O. The volume of the flow path through the magnet including the 
sampling apparatus (7.3 cm’) and excluding the reservoir is 15.0 cm®. At a flow 
rate of 13.6cm? min“, the time of flight of the electrolyte out of and back into 


operando setup (0.032. cm?). Furthermore, on-line detection provides 
superior spectroscopic resolution (Extended Data Fig. 2), because there 
is no interference from the heterogeneous (particularly the metallic) 
battery components, which lead to magnetic field inhomogeneities 
inthe operando setup””. 


Unravelling reaction mechanisms 

On-line NMR measurements were performed for a full cell with 
20 cm? of 100 mM anthraquinone (AQ) and 300 mM potassium 
hexacyanoferrate(II) as the anolyte and catholyte, respectively. On 
charging at aconstant current of 10O mA (20 mA cm”), corresponding 
to the reduction of the AQs, the battery voltage increases from 1.2 Vto 
the cut-off voltage of 1.7 V for DHAQ (Fig. 2a). Only one voltage step was 
observed, whichis consistent with cyclic voltammetry (CV) that reveals 
a single reversible redox peak centred at —0.68 V versus the standard 
hydrogen electrode (SHE; Extended Data Fig. 4a). Despite the single 
peak, a two-step, single-electron process with half-cell potentials of 
E, and E,, has been proposed previously”, defined by the following 
reactions: 


E: AQ? +e >AQ?” (1) 


E,: AQ?" +e7>AQh (2) 


Achemical comproportionation reaction then occurs: 
K,: AQ?” +AQ* = 2AQ?> (3) 


This equilibrium, quantified via the equilibrium constant K,, controls 
the concentration of radicals in the solution throughout the electro- 
chemical reactions; however, we note that no direct spectroscopic 
evidence for radical formation and the complete reduction to AQ* has 
so far been observed. 

Figure 2a presents the 'H NMR spectra of DHAQ as a function of 
electrochemical cycling. Upon charging, the proton signals closest 
to the carbonyl redox centre (Aand C) disappear almost immediately, 
whereas the proton signal farthest from the redox centre (B) broadens 
and moves towards higher chemical shifts. The apparent loss of signals 
A and Cis ascribed to electron delocalization over the semiquinone 
radical anion, which results in substantial line broadening"*”. As charg- 
ing continues, the chemical shift of B reaches a maximum and then 
moves back towards lower values and narrows as the semiquinones 
continue to be further reduced. When the cut-off voltage is reached 
and the potential is held at 1.7 V, proton signals of the final diamagnetic 
product DHAQ* (A”, B” and C”) appear. A similar trend was observed 
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the reservoir is 1.1 min. b, Inthe operando setup, the miniaturized flow cell 
(shown onthe right) consists of flow fields, tubes to flow electrolyte in and out, 
carbonelectrodes, acation-transport membraneand current collectors. The 
volume of the flow pathincluding the cell cavity (0.032cm?) is 7.8 cm?. At a flow 
rate of2.5cm’ min“, the time of flight of the electrolyte out of and back into the 
reservoir is 3.1min. RF, radiofrequency. 


for DBEAQ”, in which the proton signals closest to the carbonyl redox 
centre disappear almost immediately upon charging, whereas the 
proton signals farthest from the redox centre move towards a higher 
chemical shift and then back to lower values until the signals of fully 
reduced DBEAQ* appear (Extended Data Fig. 2d). Galvanostatic cycling 
reveals that these changes are reversible (Extended Data Fig. 5). 

The broadening of proton resonances is related to the electron 
delocalization over the radical anion: the higher the electron density 
onthe proton, the broader the peaks. Figure 2b shows the singly occu- 
pied molecular orbitals for DHAQ*” (determined by density functional 
theory (DFT) calculations), and the hyperfine coupling constants deter- 
mined by EPR at alow concentration (1 mM; Extended Data Fig. 3). 
The magnitudes of the EPR-derived hyperfine coupling constants 
corresponding to proton peaks B’, A’ and C’, are |0.15| MHz (B’) « 
|2.63| MHz (A’) <|4.64| MHz (C’), which areinagreement with the relative 
shifts and line broadenings of the corresponding proton resonances. 
Differences between the hyperfine coupling constants determined by 
DFT and EPR are ascribed to errors inherent to the DFT method and 
the lack of inclusion of, for example, solvent effects (Extended Data 
Fig. 3). Of note, the shift of the water solvent resonance (measured 
here via the HOD signal present in the predominantly D,O solvent) 
mirrors the behaviour of resonance B from DHAQ? (Fig. 2a), where 
the shift is ascribed to bulk magnetic susceptibility effects, which are 
induced by changes in the magnetic susceptibility of the solution. 
The concentration of radicals can be readily estimated from this bulk 
magnetic-susceptibility shift using the Evans method (Supplementary 
Information equations S8-S16), a well established NMR method for 
measuring the magnetic susceptibility of a solution’®. As the fraction 
of radicals is directly related to the comproportionation equilibrium 
constant K. (equation (3); see also Supplementary Information equa- 
tions S1-S7) and the state of charge (SOC), the plot of radical concen- 
tration versus the SOC can then be fitted to the analytical expressions 
of Supplementary Information equations S6 and S7 (see Extended 
Data Fig. 4 for an in-depth explanation) to extract K,. The fit for DHAQ 
is shown in Fig. 2c, yielding a K, of 3.72, corresponding to a potential 
separation (E, — F,) of 33 mV (+10 mV at 293 K; errors are thalf the full- 
width at half-maximum, FWHM) (Supplementary Information equation 
S4), in agreement with our CV-model fitting (Extended Data Fig. 4h). 
Methods section ‘Equilibrium concentrations of DHAQ”, DHAQ*” and 
DHAQ* , and CVfittings’ discusses assumptions and errors associated 
withthe two approaches for deriving F, - £,. Similar results are obtained 
via on-line and operando detection suggesting that the kinetics of the 
system is in equilibrium. In the operando experiment, the shift of the 
water resonances was only quantified (and converted into the radical 
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Fig. 2|In situ pseudo-2D'H NMR spectra acquired during electrochemical 
cycling. a, Online'H spectra of the anolyte obtained froma100 mM DHAQ 
against 300 mM K,Fe(CN), full cell acquired while charging witha current of 
100 mA. The colour scale indicates the intensity of the resonances in arbitrary 
units. The protonresonances are labelled A, Band C for DHAQ, with single- and 
double-prime labels (for example, A’ and A”) indicating the same protonsinthe 
singly and fully reduced anions, respectively. The acquisition time per NMR 
spectrum usedis 75s, and thus each spectrum is asnapshot of the 
electrochemical processes averaged over 2.1% SOC. A plot of the potential 
(black) and current (red) of the cell as a function of time are shown on the right. 
b, Labelling of the protonsand DFT-derived volumetric plot of the singly 


concentration) after there was a clear peak separation between the 
water signals arising from DHAQ and K,Fe(CN), electrolytes (Extended 
Data Fig. 2a). 


Determination of the intermolecular electron-transfer rate 

The rapid loss of proton signals A and C of DHAQ* upon formation of 
fewer than 5% radicals (Fig. 2a) suggests a rapid intermolecular electron- 
transfer process between the diamagnetic and paramagnetic ions, as 
described by the bimolecular reaction shown in Fig. 3a. NMR has been 
previously applied to study such processes“”"®; suitable approxima- 
tions have been derived enabling the electron-transfer rate constants 
to be extracted from the peak broadening (see Supplementary Infor- 
mation and ref. ”). In the slow-exchange regime, the line broadening 
is proportional to the electron-transfer rate constant k,, (Supplemen- 
tary Information equation S24), whereas in the fast-exchange regime, 
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Fig. 3 |NMR analyses of self-exchange electron-transfer reactions. a, Self- 
exchange electron-transfer reactions between oxidized and singly reduced 
AQs. b, Variable-temperature NMR spectra of 5mM DHAQ®" ina DHAQ solution 
of total concentration 100 mM.c, Arrhenius plot of the electron-transfer rate 
constant k,,, calculated on the basis of the FWHM of the DHAQ® peaks A 
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occupied molecular orbital for DHAQ*”. The values of the isotropic 
Fermi-contact hyperfine coupling constants derived from DFT and EPR 
measurements are also shown for each proton. The sign of the hyperfine 
interaction cannot be extracted from the EPR experimental data (given in 
parentheses); see Extended Data Fig. 3.c, Experimentally determined fraction 
of DHAQ?” radicals asa function of SOC determined via the on-line (blue 
triangles) and operando (purple squares) detection methods with a DHAQ 
concentration of 10O mM. The curve obtained from the on-line setup was fitted 
(red stars) using Supplementary Information equations S6 and S7 to extract the 
equilibrium constant defining the concentrations of the radical and 
diamagnetic species concentrations (K,, equation (3)). 


the line broadening is inversely proportional to k,,, (Supplementary 
Information equation S25). As the temperature increases, line broad- 
ening of proton signals in the slow-exchange regime should increase, 
whereas line broadening in the fast-exchange regime should decrease 
or remain constant. 

Variable-temperature ex situ NMR experiments were performed for 
a100 mM DHAQ electrolyte solution containing 5 mM DHAQ*" radicals 
generated by electrochemical reduction (Fig. 3b and Extended Data 
Fig. 6), in which the radical concentration was estimated by assuming 
that itis directly proportional to the applied charge. This assumption 
is valid at the beginning of the reduction/charge, because the concen- 
tration of DHAQ* is small (Extended Data Fig. 4e). As the tempera- 
ture increases from 283.5 K to 313 K, the linewidths of peaks A and C 
increase, consistent with slow exchange. The width of peak B remains 
largely unchanged, suggesting that the exchange is in the fast-exchange 
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H per) Vota ™) Radical concentration (mM) 
(coefficient of determination R?= 98.15%) and C (R?=99.49%). d, Insitu!'H NMR 
spectra and voltage of the cell comprising 100 mM DHAQ and 300 mM 
K,Fe(CN),, obtained witha lowcurrent of 1mAsoas to capture the initial stages 
of charging. e, k., calculated at different radical concentrations and SOC for 
100 mM DHAQand 100 mM DBEAQ. 
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Fig. 4|In situ NMR and mass spectrometry analyses of electrolyte 
decomposition and battery self-discharge. a,'H NMRspectra of 1OO mM 
DHAQ duringa potential hold at 1.2 V and 1.7 V, following charging at 100 mA. 
Greenarrows highlight the DHA® or DHAL® signal. The purple dashed line 
tracks the signal of protons B and B”. The disappearance of signal C” is caused 
by aH-D exchange reaction with the D,O solvent (Supplementary Fig. 2). The 
signal at 8.4 ppm arises from animpurity. b,'H NMRspectra, voltage and 
current of a100 mM 2,6-DHAQin D,O in acharge-rest experiment in N, (flow) 


regime” (see Methods). The FWHM of peaks A and C were then used 
to calculate the electron-transfer rate constant k,, under the slow- 
exchange approximation, in which the line broadening—strictly, the 
transverse relaxation rate—caused by exchange with the paramagnetic 
ions is given by the expression R,, =k,,[P], where [P] isthe radical con- 
centration. As shown in Fig. 3c, the plot of Ink,, versus temperature, 
as calculated from peak C, is linear, whereas that for peak A deviates 
from linear behaviour at elevated temperatures as proton A’—withits 
smaller electron spin density—approaches the intermediate-exchange 
regime”. An activation energy of 0.46 eV for the electron-transfer reac- 
tion between DHAQ” and DHAQ?” is obtained, a value that is larger 
than, for example, the DFT-derived values of 0.28 eV and 0.34 eV for 
electron transfer between sulfonated AQs”°. 

After validating the applicability of the slow-exchange approxi- 
mation, the self-exchange electron-transfer processes are readily 
investigated in situ at room temperature (293 K; Fig. 3d and Extended 
Data Fig. 7). Electron-transfer rate constants k,, of approximately 
1x10°M"'s*and1x10°M"'s* for DHAQ and DBEAQ, respectively—as 
shown in Fig. 3e and Extended Data Fig. 7i—were extracted from the 
broadening of peaks A and C of DHAQ and D, E and F of DBEAQ. The 
changes in the calculated rate constants k,, at the onset of reduction 
are ascribed to the inhomogeneous mixing of the low AQ*” fraction 
(<1%) with AQ* as the electrolytes leave the reactor and flow through 
the tubing into the NMR magnet. Constant (equilibrium) k,, values are 
obtained when the reaction progresses, because the radical anions 


atmosphere. c, Proposed decomposition reaction of DHAQ* .d, Insitu mass 
spectrometry signals of D, (mass-to-charge ratio, m/z=4) during a stepped 
potential-hold experiment (black, potential; red, current). The potential was 
increased from1.2 Vto2.1V, stepping inincrements of 0.1V, holding for 1h at 
each potential step with an interleaved 4 hrest period. 8 cm? of 1OOmMDHAQ 
and 8 cm’ of 300 mMK,Fe(CN), dissolved in D,O are loaded in each electrolyte 
compartment of acustom-made H cell. a.u., arbitrary units. 


become homogeneously distributed, evidenced by the plateau at 
higher radical concentrations. The higher intermolecular electron- 
transfer rate for DBEAQ is probably due to (1) the weaker intermolecular 
Coulombic repulsions between DBEAQ anions than between DHAQ 
anions, because the negative charges are more dispersed on DBEAQ 
than on DHAQ, and (2) the stronger van der Waals and hydrophobic 
interactions between the longer (non-polar) R groups in DBEAQ. The 
values of DHAQ and DBEAQ based on the in situ and ex situ NMR analysis 
areinthe range of the previously reported values?’ ” on other organic 
systems, spanning 10* M“s7?to10"°M‘s7, 


Following electrolyte decomposition and battery self-discharge 
inrealtime 

Thein situ NMR approachenables us to follow electrolyte decomposi- 
tion under specific cycling conditions. For example, for DHAQ, new 
1H NMR signals were observed at 6.5, 6.8, 7.1, 7.7 and 7.9 ppm during a 
potential hold at 1.7 V, performed to ensure the complete reduction 
of DHAQ (Fig. 4a). On the basis of ex situ two-dimensional (2D) NMR 
correlation experiments and DFT-derived chemical shifts (Extended 
Data Fig. 8), these signals are assigned to the degradation products 
anthrone (DHA® ) and anthrol (DHAL*), as identified previously”. 
Subsequent galvanostatic cycling of the solution without a potential- 
hold step did not substantially change the signal intensity of DHA® and 
DHAL* (Extended Data Fig. 8e). By contrast, when the potential hold 
was reduced to1.2 V,amuch smaller concentration of decomposition 
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products was detected (peaks at 6.7, 7.1and 8.1 ppm; Fig. 4a), suggesting 
that the nature and extent of decomposition depends on the reduc- 
tion potential. These observations suggest that the decomposition is 
an electrochemical reaction with a possible route outlined in Fig. 4c. 
A chemical disproportionation reaction with water, as previously 
proposed”, cannot be excluded; however, the voltage dependence 
suggests that the products seen here are formed electrochemically. 
By contrast, for DBEAQ, no decomposition products were detected 
during a potential hold at 1.4 V and 1.7 V (Extended Data Fig. 9), which 
is in agreement both with previous work” and with the proposal that 
degradation of DBEAQ occurs over long-term cycling via a mecha- 
nism involving nucleophilic attack, rather than by degradation of the 
reduced species”. 

The in situ NMR technique can be readily used to monitor battery 
self-discharge. As shown in Fig. 4b, the open-circuit voltage of a charged 
battery (under the protection of flowing N, gas) slowly decreases 
from1.3 Vto1.1V, and then after 17 h asharp decrease to 0.5 Vis seen. 
Thein situ NMR spectra show that this is caused by the reoxidation of 
DHAQ* to DHAQ®” and DHAQ”. The rapid drop in open-circuit volt- 
age occurs when the DHAQ” signals A and B reappear and sharpen, 
indicating that the drop is due to complete oxidation of the solu- 
tion. To identify the potential oxidant, we developed an in situ mass- 
spectrometry technique based on the design of the H cell to moni- 
tor the gas evolution (Fig. 4d). A stepped-potential experiment was 
performed from 1.2 V to 2.1 V, in increments of 0.1 V. D, evolution 
commences at 1.2 V, and it is steadily evolved both at higher potentials 
and during the rest periods. These two observations suggest that D, evo- 
lution originates from achemical reaction, most probably from water 
reduction: DHAQ* + 2D,0 > DHAQ? + D, + 20D-. We note that the 
redox potential of DHAQ is higher than that of the hydrogen evolu- 
tion from water (-0.83 V versus SHE)” ina 1M KOH aqueous solution, 
making this reaction thermodynamically unfavourable (unless 
substantial pH fluctuations occur). We cannot rule out the further 
reaction of the degraded electrolytes, because there is considerable 
literature precedent for reactions of AQ-based structures that involve 
hydrogen evolution—anthrone dimerization, for example, forms 
bianthrone accompanied by hydrogen evolution”. In addition, some 
reported systems are photosensitive, providing another potential 
degradation route”®. Further investigation into the various degradation 
mechanisms is in progress to understand howto control the stability 
of anthrahydroquinones in aqueous media. 

Insummary, we have demonstrated two in situ NMR metrologies to 
study flow batteries. The formation of radicals and fully reduced anions 
is directly observed in two AQ-based redox flow battery systems, in 
which their equilibrium concentrations are governed by the poten- 
tials of the two single-electron-transfer redox processes. The radical 
concentrations as a function of SOC were quantified by analysing the 
bulk magnetic susceptibility changes, enabling the voltage separation 
of the two successive reductions to be extracted. The redox reaction 
was found to be coupled with electron transfer between the radicals 
and diamagnetic anions, with NMR spectroscopy providing amethod 
to measure the rates of these reactions. The presence of self-exchange 
electron-transfer reactions in organic flow batteries has major implica- 
tions because it affects the overall rates of the redox reactions, control- 
ling, for example, the comproportionation rate. Electrochemically 
triggered decomposition of DHAQ* to DHA*/DHAL* was observed 
under specific cycling conditions, but no decomposition of DBEAQ* 
was observed. The real-time observation of reoxidation of DHAQ* and 
hydrogen evolution in these aqueous media indicates that other side 
reactions are occurring that involve solvent water and/or degradation 
of DHAQ*. Owing to the simplicity of the on-line NMR setup—which 
consists, inessence, of alaboratory-scale redox flow battery anda flow 
NMR sampling tube—we expect wide adoption of this technique, which 
will help to advance the understanding of various redox chemistries 
in flow batteries and other electrochemical systems. Beyond battery 
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research, we demonstrate a way to study radical species, particularly 
at high radical concentrations when hyperfine coupling features inan 
EPR spectrum are lost owing to both electron-spin interactions and 
electron-transfer reactions. Our work shows that by following spectral 
changes in real time, NMR can provide key information concerning 
molecular structure, spin-density distributions and intermolecular 
electron-hopping rates. 
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Methods 


Materials and synthesis 
2,6-dihydroxyanthraquinone (2,6-DHAQ, A89502 technical grade, 
90% purity), potassium hexacyanoferrate(1I) trihydrate (P3289, 
>98.5% purity), D,O (151882, 99.9 atom%) and ethyl 4-bromobutyrate 
(167118, 95% purity) were purchased from Sigma Aldrich Chemicals. 
Anhydrous N,N-dimethylformamide (43465, >99.9% purity), potassium 
ethoxide (14263.30, >95% purity), anhydrous potassium carbonate 
(A16625, =>99.0% purity), isopropanol (20839.366, >99.0% purity) and 
glacial acetic acid (20103.364, 99-100%) were purchased from VWR. 
4,4’-((9,10-anthraquinone-2,6-diyl)dioxy)dibutyric acid (2,6-DBEAQ) 
was synthesized following a previously reported method”. 2,6-DHAQ 
(5.00 g, 20.8 mmol) was dissolved in anhydrous DMF (250 cm?) and 
potassium ethoxide (6.13 g, 72.8 mmol) added to the solution under 
vigorous stirring. The mixture was stirred at room temperature for 
15 min. To this, anhydrous K,CO, (14.40 g, 104.2 mmol) and ethyl 
4-bromobutyrate (21.41 g, 109.8 mmol) were added and the mixture 
solution was then heated to 95 °C for 23 h. The reaction mixture was 
poured into deionized water, and the solid collected by filtration, 
washed with deionized water and dried in a vacuum oven overnight 
to afford diethyl 4,4’-((9,10-anthraquinone-2,6-diyl)dioxy)dibutyrate 
(2,6-DEDBEAQ, 9.20 g, 94.2%), the ester precursor of 2,6-DBEAQ, as 
silver-grey powders. 2,6-DEDBEAQ (1.00 g, 2.13 mmol) was hydrolysed 
ina0.15M KOH water:isopropanol (2:1 v/v) solution (60 cm’) at 60 °C for 
12h. The reaction mixture was poured into deionized water (200 cm*) 
and glacial acetic acid added dropwise to adjust the pH to 4. The solid 
was collected by filtration, washed with deionized water and dried in 
a vacuum oven overnight to afford 2,6-DBEAQ in a quantitative yield. 
The NMR spectra that were subsequently acquired were consistent 
with previously reported spectra®. 


Flow battery assembly 

The hardware of the flow battery was purchased from Scribner Associ- 
ates. Ultrahigh-purity sealed graphite flow plates with serpentine flow 
patterns were used for both electrodes. Each electrode comprised 
4.6 mm or 6.0 mm carbon felt (SGL) with a 5 cm? active area. Nafion 
212 was used as the ion transport membranes. Pretreatment of the 
Nafion 212 membranes was performed by first heating the membrane 
in80 °C deionized water for 20 min and then soaking it in5% hydrogen 
peroxide solution for 35 min. The treated membranes were stored in 
0.1M KOH solution at room temperature. PTFE frames with a thick- 
ness of 3 mm were used to position the electrodes with Viton gaskets 
0.7 mm in thickness oneach side of the frames. The current collectors 
were gold-plated copper plates. Anodized aluminium end plates with 
reactant input/output ports were used; however, it was found that 
the coatings were easily corroded by basic solutions. Direct solution 
contact was avoided by carefully inserting the inlet and outlet tubing 
through the Viton O-ring seals. 

Astock solution of 1M KOH dissolved in D,O was prepared and used 
as the solvent. On the negative side, either 200 mM (in 1.4 M KOH), 
100 mM, 50 mM or 30 mM 2,6-DHAQ was prepared in 20 cm’ of sol- 
vent. The SOC of the battery was calculated by dividing the number 
of electrons that have flowed from the electrochemical cycler by the 
theoretical storage capacity of the quinone anions, assuming a two- 
electron-per-anion redox process. For example, for al100 mM DHAQ of 
90% purity, the theoretical capacity is 96.5 mAh. On the positive side, 
potassium hexacyanoferrate(11) trihydrate was dissolved in 20 cm? of 
solvent to form a300 mM solution; this corresponds to 1.5 times the 
total capacity of the 100 mM 2,6-DHAQ solution forthe same volume of 
electrolyte. When the battery undergoes galvanostatic cycling, 50 mM 
potassium hexacyanoferrate(III) is added to the positive side to ensure 
that both Fe** and Fe” are in excess during battery cycling. 

Custom-made glassware made from Pyrex with gas inlet, outlet, 
liquid inlet and outlet were used as electrolyte reservoirs. Prior to the 


experiments, all solutions were degassed with N, gas rigorously for 
30 min to 1h. The torque applied onthe bolts that tighten the cell was 
found to affect the battery performance. It was optimized at 2N mon 
each bolt. The galvanostatic cycling of the battery was controlled by 
a portable potentiostat (SP-150, BioLogic SAS). 


On-line NMR setup 

A custom-made medium-wall flow-through NMR sampling tube of 14cm 
inlength and 10 mm outer diameter was positioned in a micro-imaging 
probe (Bruker 2.5; Extended Data Fig. 1a). The electrolyte solution 
flows from the bottom to the top of the tube. The inlet and outlet of 
the sampling tube were connected to two 1/16 inch PFA tubes (0.5 mm 
inner diameter) 3 minlength via 1/16 inch to 1/8 inch tube adaptors. The 
PFA tube at the bottom is connected to the outlet of the battery; the 
PFA tube at the top is connected to the inlet of the electrolyte reservoir. 
The electrolyte is pumped through the sampling tube and the flow 
battery which is positioned next to a300 MHz NMR magnet outside 
the 5 gauss (G) line by a peristaltic pump (MasterFlex L/S 07551-20, 
Cole-Parmer; pump head, MasterFlex tubing 77202-60 #14, Chem Bio). 
The volume of the electrolyte inside the NMR sampling tube is 7.3 cm*. 
The volume of the electrolyte in the PFA tubes is 1.2 cm®. The volume 
of the electrolyte that passes through the MasterFlex tube is 6.5 cm’. 
Onthe basis of an inner diameter of 8.16 mm of the sampling tube and 
3 cm of the detection length by the NMR probe, the effective volume 
detected by NMR is 1.57 cm®. Flow rates were measured at different 
rotary speeds of the pump using the same tubing, as shown in Extended 
Data Fig. 1d. Ata flow rate of 13.6 cm’ min“, the electrolyte takes 60 sto 
flow back to the reservoir, so the time lag between the electrochemical 
cycling and the NMR detection is 30s. 

As shown in Extended Data Fig. 1c, pseudo-2D NMR experiments were 
performed by direct excitation with a 90° pulse. The acquisition time 
is 1.5 s.Eachspectrum has anumber of scans, n,=8. After eight scans, a 
time delay d, isintroduced before the next spectrum acquisition starts. 
Witha recycle delay, d,=7s, and d,=7s, the total acquisition time ofa 
spectrumis 75s ((7+1.5) x 8+7=75s). Owing to the high ionic conduc- 
tivity of the basic solution, the 90° pulse width for a proton increases 
from 20 ps for a non-conductive solution to 27 ps at a radiofrequency 
power of 30 W. Pulse calibration was performed as a function of SOC 
and the 90° pulse width remained the same. All spectra were refer- 
enced to the water chemical shift at 4.79 ppm before battery cycling 
started. The spectral widths were 200 or 20 ppm. Peak assignment was 
facilitated by the scalar coupling (/) between protons A andB,/,,, and 
between protons B and C,/,,. The/-coupling interactions among the 
three aromatic protons were H(/,,) = 8.6 Hz and H(/,,) = 2.5 Hz. 


Operando NMR setup 

The in situ cell assembly consists of six key components: the PEEK flow 
field, carbon electrode, an ion-transport membrane, current collectors, 
a PEEK sleeve and PEEK tubes (flow inlet and outlet). Pictures of the 
in situcell are shown in Extended Data Fig. 1b. The diameter of the cell 
assembly is 9.9 mm, which fits into a micro-imaging probe (Bruker 2.5). 
The structures of the catholyte and anolyte compartment are identical 
except for an extended solid part 12.20 cm in length attached to one 
compartment, whichis held inthe NMR probe bya screw cap. The flow 
field inside the electrolyte compartment is 1.80cminlength,0.40cm 
in widthand 500 pmin depth. A layer of carbon electrode (39AA, 80% 
porosity, Sigracet) of 1.80 cm in length, 0.40 cmin width and 280 pmin 
thickness is placed inside the flow field. A current collector made of two 
gold wires of outer diameter 0.5mm and length 16.0 cm passes through 
a hole at the back of the flow field and is in electrical contact with the 
carbon electrode. The other end of the current collector is connected 
to an electrical cable that passes through a d.c. 5 MHz low-pass filter 
at the top of the NMR magnet and connects to the electrochemical 
cycler. A treated Nafion 212 membrane 2.80 cm in length, 0.80 cmin 
width and 50.8 um in thickness is compressed between the catholyte 
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and anolyte compartment by Viton O-rings. The cell assembly is held 
inaPEEK sleeve of outer diameter 0.99 cmand inner diameter 0.89 cm. 
Theelectrolyte inlet and outlet on each compartment are PEEK tubes 
(outer diameter, 1/16 inch) 10 cmin length. They are connected to 
1/16 inch PFA tubes 3 min length via 1/16 inch to 1/16 inch unions that 
pass through the bottom of the NMR probe. The PFA tube is connected 
toa peristaltic pump (MasterFlex L/S 07551-20, Cole-Parmer; pump 
head, MasterFlex tubing 77202-60 #14, Chem Bio), and then to the 
electrolyte reservoir. The total volume of electrolyte in the PFA tubes 
is 1.2 cm*. The volume in the MasterFlex tube on the pump is 6.5 cm’. 
Ata flow rate of 2.5 cm? min“, the electrolyte takes 185 s to flow back to 
the reservoir, and so the time lag between the electrochemical cycling 
and the NMR detectionis 92s. The effective NMR detection volume in 
thein situ cellis 0.032 cm’ (excluding the volume of carbon electrode), 
and therefore the residence time in the detection region of the NMR 
probeis 0.8 s. Pseudo-2D NMR experiments were performed witha 90° 
pulse of 27 p's, an acquisition time of 0.15 s (d,=1s,d,=1s,n,=64). The 
total acquisition time of a spectrum is 75 s ((1+ 0.15) x 64+1=74.6s). 


Insitu relaxation measurements 

The longitudinal (7,) and transverse (7,) relaxation time measurements 
were performed in situ via the on-line setup. A full battery with 20 mL of 
100 mM DHAQ and 40 mL of 200 mM K,Fe(CN), was charged at 100 mA 
to the cutoff potential of 1.7 V. The potential was then held at 1.7 V for 
40 min. A flow rate of 13.6 cm? min” was used. The flow and electro- 
chemical cycling were paused intermittently during charging (every 5 or 
10 min, see Extended Data Fig. 10), and the relaxation measurements 
were performed onastatic solution. The 90° pulse width was calibrated 
before each measurement and was found to bethe same throughout the 
electrochemical cycling. After each relaxation measurement, flow and 
electrochemical cycling were resumed. The relaxation measurement 
at each SOC takes up to 30 min, which is much shorter than the time 
for the reoxidation of DHAQ (17h). There was no noticeable change of 
chemical shift before and after each measurement, suggesting that the 
reduced DHAQ is stable onthe time scale of the relaxation measure- 
ment. Nonetheless, there will be errors arising from the reoxidation 
and decomposition of DHAQ during the measurement, particularly 
at high SOC (as discussed in the main text). The errors are reflected in 
the data fitting and are shown in Extended Data Fig. 10b, c. 

T, relaxation time measurements were performed with an inversion- 
recovery (tlir) pulse sequence comprising 180° and a 90° pulses, with 
an increasing delay time (¢) between the two pulses for each spectrum 
inthe second dimension. Eight to 16 spectra were acquired in the sec- 
ond dimension. A recycle delay of 30 s was applied for the oxidized 
DHAQ. As the 7, time of proton B in DHAQ in the presence of radicals 
is an order of magnitude shorter than that of HOD, two separate meas- 
urements were performed on DHAQ proton B and HOD, with a d, of 
0.5s and 2s used for each molecule. The signal integral /(¢) was plotted 
as a function of tand fitted by /(t) =/,(1- 2e'/"), where /, is the maxi- 
mum peak integral. 

T, relaxation time measurements were performed with a Carr- 
Purcell-Meiboom-Gill pulse sequence comprising a 90° pulse anda 
train of 180° pulses, with a delay time of 1 ms before and after each 180° 
pulse. Eight to 16 spectra were acquired in the second dimension, with 
an increasing number (up to 2,000) of 180° pulses. The signal integral 
i(t) was plotted as a function of the sum of the delay time t and fitted 
with I(t) =Ihe/"2. 

T, relaxation times for the H“, H® and HS protons (corresponding 
to A, Band C, respectively) and HOD were measured to be 1.6 + 0.2, 
1.9+0.2,4.9+ 0.2 and 13 +1s, respectively. We note that these 7, times 
depend on the protonation level of the deuterated solvent, because 
the relaxation is largely driven by the proton-dipolar coupling inter- 
action for the diamagnetic solutions. Shorter 7, relaxation values for 
H4", H®’ and H© on the fully reduced DHAQ* anion were measured to 
be 0.32 + 0.06, 0.34 + 0.07, and 0.22 + 0.08 s, respectively, but these 


were measured in the presence of radicals, thatis, in the in situ experi- 
ments. The errors here represent the 95% confidence level from the 
fit. T,and T, times were then measured as a function of SOC. Extended 
Data Fig. 10a presents the voltage profile of a full battery of 1OO mM 
DHAQ and 200 mM K,Fe(CN), during intermittent charging witha 
current of 100 mA. Extended Data Fig. 10b shows that the measured 
T, and T, values for HOD both decrease rapidly on charging to an SOC 
of 9%: T, decreasing from 13.1s to below 0.7 s, and T, decreasing from 
5.1s to below 0.5 s. The changes are more gradual thereafter, reaching 
a minimum of 0.1s (for both 7, and T,) at 50-70% SOC. Both T, and T, 
then increase to 0.4 s as the battery is charged to its full capacity. The 
T, value for B follows the same trend as that of HOD, decreasing from 
1.9s to below 0.03 s at an SOC of 9% (Extended Data Fig. 10c). 


Effect of flow rate, radical concentration and relaxation times 
on magnetization build-up and line broadening 

Under flow conditions, the build-up of magnetization of the nuclear 
spins is determined by the time that the electrolyte molecules spend 
in the high magnetic field’. Since 57, enables a build-up of 99.3% of 
the maximum thermal polarization, ideally a residence time of the 
electrolyte in the field, t, should be longer than 57,. Tis related to the 
flow rate uv by t= V/u, where Vis the volume of electrolyte in the high 
magnetic field. Setting Vto be the same as the detection volume of 
the NMR probe gives the lower limit of the residence time, because 
the region of high magnetic field extends to a longer length than the 
detection region of the NMR probe. In the on-line setup, the volume 
of the detection region is 1.57 cm’. Aresidence time of 57, of the DHAQ 
protons H4 (7, =1.5s), H® (1.9 s), HS (4.9) and HOD (13 s) requires flow 
rates lower than 12.6, 9.9, 3.8 and 1.4 cm’ min“, respectively, for quan- 
titative measurements of the diamagnetic species in the absence of 
any radicals. In the operando cell, the detection volume is 0.032 cm? 
and a residence time of 57, for H’, H®, H© and HOD gives flow rates of 
0.3, 0.2, 0.1and 0.03 cm? min“, respectively. We note that if faster flow 
rates are required, simple methods for polarizing the nuclear spins 
before the liquids actually enter the radiofrequency coil (for exam- 
ple, by adding loops or liquid reservoirs in the magnet) can be readily 
added to the setup. 

Extended Data Fig. 10d, e shows the'H NMRspectraof DHAQ and HOD 
as a function of flow rate without electrochemically cycling the bat- 
tery inthe on-line setup. The signal integral is plotted against the flow 
rate, as shown in Extended Data Fig. 10f, g. As the flow rate increases, 
the decrease of the water signal is the most pronounced, owing to its 
large value of 7,, followed by proton C of DHAQ. The signal intensity 
of Aand Bis almost unaffected by the flow rates studied here, owing 
to the shorter 7, values of H’ and H®. Of note, when radical species (for 
example DHAQ*" ) are generated, T, will be decreased substantially, 
owing to the nuclei-electron spin interactions, and much higher flow 
rates will be possible without reducing the signal intensity. 

To achieve optimized electrochemical performance of the battery 
system-—that is, a low overpotential—a high flow rate is desirable in 
order to drive the system out of the mass-transport-limited regime. 
Given the detection volumes of 1.57 cm? and 0.032 cm? in the on-line 
and operando setups, respectively, flow rates that allow quantitative 
measurements of DHAQ proton B in the presence of small concentra- 
tions of radicals and allow a residence time of >57, (5 x 0.03 s) corre- 
spond to <628.0 cm? min” and <14.4 cm’ min“, respectively. The flow 
rates in the majority of the on-line and operando NMR experiments 
are set to 13.6 cm’ min“ and 2.5 cm? min“, respectively, which are in 
this quantitative regime. 

During NMR data acquisition, the thermal polarization is converted 
into transverse magnetization bya 90° radiofrequency pulse, andthe 
decay of the transverse magnetization (the T, relaxation time) deter- 
mines the linewidth of the NMR signal. Thereis a finite probability that 
a fraction of the spins with transverse magnetization will leave the 
detection region before the data acquisition finishes, which may cause 


further line broadening. The longer the 7, relaxation timeis, the more 
susceptible the signal is to line broadening from flow. The FWHM of 
the DHAQ and the HOD signals as a function of flow rate are plotted in 
Extended Data Fig. 10h. As the flow rate increases from 13.6 cm? min? 
to 37.5 cm? min“, the HOD FWHM increases slightly, from 6 to 7 Hz, 
whereas the FWHM of the DHAQ signals show negligible change. Inthe 
presence of the DHAQ*” radicals, which are produced during battery 
cycling, T, is decreased and thus the effect of flow on the linewidth of 
the DHAQ proton signals is further reduced. 


Study of H-D exchange by ex situ NMR 

A solution of 100 mM 2,6-DHAQ dissolved in 1 M KOH in D,O was 
prepared and left in an ambient environment for 30 days before taking 
the first NMR spectrum. The same solution was then charged (cor- 
responding to the electrochemical reduction of 2,6-DHAQ) in aredox 
flow battery and an aliquot of 0.5 cm? was extracted. The aliquot was 
transferred to a5 mm thin-walled NMR tube and one-dimensional 
NMR spectra were acquired ona 500 MHzsolution NMR spectrometer. 
For'H NMR, 32 scans were accumulated with a 30° pulse and d, =1s. 
For®C NMR, 1,024 scans were accumulated witha 30° pulse and d,=3s. 


Study of electron-transfer reactions by variable-temperature 

'H NMR experiments 

The solvent of 1M KOH inD,O was degassed by argon gas for 2h. Forthe 
charge-rest experiments, aH cell was assembled inside the glovebox 
(described below).20 cm’ of 1OO mM DHAQ solution was first reduced 
at 10 mA for 30 min (9 mM concentration of radicals) and an aliquot 
of 0.1 cm? was extracted and sealed in a 5 mm thick-walled NMR tube 
with an airtight Young’s tap. Then the DHAQ was oxidized at 1 mA for 
140 min (5 mM of radicals) and an NMR sample was prepared in the 
same way as for the 9 mM radical concentration. 

NMR spectra were acquired on a 500 MHz solution NMR spectro- 
meter using a one-pulse (90°) sequence. For the spectra acquisitions 
of the sample containing 5 mM of radicals, the temperature of the NUR 
probe was ramped up—from 283 K to 313 K in increments of 5 K—and 
then cooled down to 288 K; spectra were acquired at each temperature. 
The heating was carried out in this way to ensure that any reoxidation of 
thesolution did not perturb the measurement. For the spectral acquisi- 
tion of the sample containing 9 mM of radicals, the temperature was 
set at 283 K and then ramped up from 288 K to 338 K inincrements of 
10 K. Because the magnetic field was locked on the water resonance 
at 4.79 ppm and the water resonance is temperature-dependent, the 
shift of peak A of DHAQ was manually set at 7.3 ppm after acquisition 
to allow for ready comparison of the different spectra. 

As shown in the EPR results (Extended Data Fig. 3), the magnitudes 
of the hyperfine coupling of the three protons are 0.15 MHz (B’) 
<« 2.63 MHz (A’) < 4.64 MHz (C’). Owing to the much smaller hyper- 
fine coupling constant, the exchanges between the protons that give 
rise to resonances B and B’, and those between B’ and B” are in the 
fast-exchange regime. (We shall refer to the coalesced resonances as 
B/B’ and B’/B”.) By contrast, the exchanges involving A/A’ (and A’/A”) 
and C/C’ (and C’/C”) are in the slow-exchange regime. This is also shown 
by the variable-temperature NMR experiments shown in Extended 
Data Fig. 6b, c where the linewidth of resonance B/B’ is largely insensi- 
tive tothe changes in the temperature and in the electron-transfer rate. 

To verify the dependence of linewidth on the exchange rate k,,, the 
effect of two-site chemical exchange on the spectra was simulated with 
the program Spinach”. In the simulation, the concentration of radicals 
was set to either 5 mM or 9 mM, thatis, the same as the radical concen- 
tration used inthe variable-temperature experiments. The position of 
resonance B’ of 7.51 ppm was used, which is estimated from the shift of 
Binthe on-line NMR spectra (Fig. 2a) at lowradical concentrations. For 
example, inthe presence of 1.35% radicals (as determined by the shift of 
the water resonance via the Evans method), the shift of Bis 6.46 ppm, 
and the shift of B’ is given by: 


(6.46 - 98.6 x 6.45)/1.35 ppm (4) 


where 6.45 ppmis the shift of resonance B in the absence of radicals. 

To illustrate that the exchange is indeed in the fast-exchange limit, we 
varied the exchange rate constant k,, from 0to10° M's. Coalescence 
of resonances Band B’ occurs at approximately 10* M's ‘anda super- 
position of the spectra obtained with k,, = 0.5 x 10° to 10° M's show 
only negligible changes in linewidth, consistent with our suggestion 
that exchange is indeed in the fast regime for these k,, values. k,,, was 
then set to either 0.50 x 10°, 1.05 x 10° or 3.0 x 10° M's ‘inthe simula- 
tions, which corresponds to the measured values at 283.5, 293.0 and 
313.0 K from the variable-temperature experiment (Fig. 3c; extracted 
from the analysis of resonances A andC). As the temperature increases, 
the FWHM of resonance B/B’ decreases from 16.7 to 13.9 Hz and then 
to 10.7 Hz in the simulations. The experimental spectra (Extended 
Data Fig. 6b) are more complex because they contain/ coupling. How- 
ever, deconvolution of the resonances indicates that the broadening 
decreases from 3.3 to 3.2 Hzand then to 2.9 Hz. Onincreasing the radical 
concentration to 9 mM, the simulated line broadenings decrease from 
29 to19 Hz withthe same exchange rates, whereas experimentally the 
linewidth decreases from 24.1 to 23.5 Hz. Although our simulations 
are in reasonable agreement with experiment, they predict slightly 
larger broadenings than seen experimentally, and larger errors are 
observed with lower concentrations of radicals. This is ascribed to: 
(1) uncertainty in the 7, of resonance B’—decreasing this value from 
500 Hz (the value chosen for the original simulations) to 50 Hz 
decreases the line broadening of resonance B/B’ from 13.9 to 7.7 Hz for 
k.,=1.0 x 10° M7sand5 mM radical concentration; and (2) uncertainty 
inthe shift of B as charging progresses and the pH changes— only very 
small changes will have a substantial effect on the line broadening, 
because the shift difference between B and B’ is so small. To address this 
at least in part, we estimated the shift of B’ at the beginning of charge, 
where pH and bulk magnetic susceptibility effects are probably smaller; 
and (3) the lack of inclusion of the effect of temperature on the shift of 
B’ (and B) and their relaxation times. 


Identification of the DHAQ decomposition products 

The following NMR experiments were performed on an aliquot solu- 
tion of 0.2 cm? taken from the H-cell experiments at 470 h (which was 
the end of acharge-rest experiment), and sealed ina 5mm NMR tube 
(see Methods section ‘H-cell experiments in an argon glovebox’). 


'H diffusion-ordered spectroscopy 

Diffusion-ordered spectroscopy (DOSY) spectra were obtained with 
a 2D sequence for diffusion measurements using a stimulated echo 
anda longitudinal eddy-current decay using bipolar gradient pulses”. 
The diffusion coefficient, D, was calculated with: 


f(g) =Ie v8 & 4-6/3 (5) 


where yis the gyromagnetic ratio of the proton (26,752 rads'G") andg 
is the variable gradient strength ranging from 0 to 2,100 Gcm'with16 
increments of gradient strength. The length of the gradient 6is 1.5 ms 
and the diffusion time A is 100 ms. 


'H homonuclear correlation spectroscopy 

Homontuclear correlation spectroscopy (COSY) with an artefact-free 
PFG-enhanced double quantum filter®° was performed. The second 
dimension was constructed using 192 increments spanning 13 ppm. 
d, was 2s and n, for eachincrement was 2. 


BC distortionless enhancement polarization transfer 

Spectra were acquired with a shaped pulse of 180° on “C, and a 135° 
read pulse and proton decoupling were applied during acquisition. 
The delay between the 90° and 180° pulses was 3.45 ms. This pulse 
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sequence yields positive peaks with one or three protons bonded to 
the carbon atom and negative peaks with two protons bonded to the 
carbonatom. 


1H/“C correlation heteronuclear single quantum coherence 
Experiments were performed via an insensitive nuclei enhanced by 
polarization transfer (INEPT) experiment*! °°, Phase-sensitive acquisi- 
tion with an echo/antiecho-TPPI (time-proportional phase incrementa- 
tion) gradient selection and decoupling was used during acquisition. 
Trim pulses forthe INEPT method with multiplicity editing during the 
selection step as well as shaped pulses for inversion on®C for matched 
sweep adiabatic pulses were also used during acquisition. The second 
dimension (8C) was constructed using 1,024 increments spanning 
190 ppm. d, was 0.8 sand n, for each increment was 4. 


H-cell experiments in an argon glovebox 

Study of electrolyte decomposition and battery self-discharge. 
The solvent was prepared by dissolving 1 M KOH in D,O followed by 
vigorous degassing for 2 h, and was then loaded into a glovebox (oxygen 
level <O.1 ppm), along with parts of the H cell. The oxygen level of the 
solvent was measured to be 0 ppm (0.2 ppm accuracy) by a precision 
dissolved-oxygen meter. 100 mM DHAQ and 300 mM K, Fe(CN), solu- 
tions were prepared in 20 cm? of solvent and were placed in the H cell 
inside the glovebox. The H cell witha sampling port was made in-house 
(Supplementary Fig. 3a). Pretreated Nafion membrane was sandwiched 
between the two half-cells. Three pieces of carbon paper (Sigracet 
39AA) witha dimension of 1cm x 3cm were immersed in the solution 
oneach side and electrically connected toa portable potentiostat (SP- 
150, BioLogic SAS). A charge current of 10 mA was applied to a cut-off 
voltage of 1.9 V and the voltage was continuously measured during 
cellresting for up to 470 h. The solution was stirred vigorously during 
the experiment. NMR, infrared and mass spectrometry analysis were 
performed onthesolution and headspace gas during the charge-rest 
cycling. 


Study of battery self-discharge by infrared spectroscopy 
Attenuated total reflection infrared spectra were acquired witha 
Fourier-transform infrared spectrometer (Cary 630) inside an argon 
glovebox. Thespectral range was from 4,000 to 400 cm“ witha resolu- 
tion of 2cm™. Sixteen background and sample scans were performed. 

During the charge-rest experiment, 10 mm? aliquots of DHAQ solu- 
tion were extracted intermittently from the sampling port of the H cell 
bya syringe. These were then dropped onto the spectrometer sampling 
window to enable infrared spectra to be acquired. 


Ex situ mass spectrometry 

Mass spectrometry was performed using an in-house system connected 
toa quadrupole gas analysis system (Pfeiffer ThermoStar). The analysis 
was performed on 2 cm? of gas sampled from the headspace of the 
same solution inthe H cell after the charge-rest cycling. 5 cm’ syringes 
were used to extract the gas, and the syringes containing the sampled 
gas were transported in an airtight plastic box from the glovebox to 
the mass spectrometer. The transport time was approximately 2 min. 
The gas or solution was injected into an online T-shape glass sampler. 
The carrier gas was argon at a flow rate of 100 ul, s7 (11, 1 litre under 
normal conditions of 101.3 kPa and O °C) at 1.1 bar(a) (absolute pres- 
sure). The dwell time for m/z=4 was 5s. 


Study of gas evolution by in situ mass spectrometry 

An electrochemical H cell was designed and connected to an online 
electrochemical mass spectrometry system (Supplementary Fig. 4a, b). 
The cell was based ontwo 1/2 inch stainless steel tees (Swagelok) inter- 
connected via a liquid-tight glass union with a membrane fitted in the 
middle (Nafion 212). Both sides of the cell were capped at the bottom 
with a round-bottomed glass test tube fitted with a magnetic stirrer 


andat the top witha stainless steel plunger wherea working electrode 
was affixed. All connections were made both liquid- and gas-tight with 
PTFE ferrules. The total internal volume of each compartment was 
about 10 cm®. To sample the headspace (about 1 cm’) the top plunger of 
the anolyte (AQ) compartment of the cell was fitted with two stainless 
steel tubes that were connected to a gas line through double-shut-off 
quick connects (Beswick). The flow of the argon carrier gas was con- 
trolled bya mass-flow controller anda pressure controller (Bronkhorst) 
and set to 2001, s ‘at 1.1 bar(a). After passing through the head space of 
the anolyte, the sample gas was fed to a quadrupole mass spectrometer 
(Pfeiffer Thermostar) through a capillary (inner diameter = 0.22 mm) 
heated to 120 °C to prevent condensation. A potentiostat (Ivium Vertex) 
was connected to both sides of the cell to control the electrochemical 
operations. 


Cyclic voltammetry 

Electrochemical measurements were performed on a potentiostat 
(SP-150, BioLogic SAS) using anin-house small-volume CV cell. Polished 
3mm diameter glassy carbon (A-012744, BioLogic SAS) was used as the 
working electrode, and coiled platinum wire was used as the counter- 
electrode. For the reference, a Hg/HgO (1 M KOH) electrode witha 
potential of 0.14 V versus SHE was used. The 1M KOH solution was made 
under inert atmosphere by the addition of degassed Millipore water 
toaknown quantity of potassium hydroxide. DHAQ (16 mg) or DBEAQ 
(41.2 mg) were dissolved in the KOH solution (using 13.32 cm? and 10 cm? 
of 1M KOH, respectively) under inert atmosphere. A sample of the 
solution was then extracted and added to the nitrogen-flushed small- 
volume electrochemical cell under inert atmosphere. The electrodes 
were then checked to ensure that no bubbles had formed during the 
addition of water before cycling was initiated. A constant overpressure 
of nitrogen was maintained during the experiment. The voltage was 
scanned from0 Vto-1.5V at 20 mVs7. 


Equilibrium concentrations of DHAQ? , DHAQ*” and 

DHAQ*, and CV fittings 

In Extended Data Fig. 4h, we have fitted the first full CV cycle with four 
different approaches. The two 2e processes (approaches 1and 2) are 
an illustrative comparison, whereas the 3rd and 4th approaches (both 
le + le’ processes) are more relevant. In each case, we implemented 
code using the SciPy Python library and its curve-fitting function. As 
the values that were tobe fitted have differing orders of magnitude, the 
x_scale =jac option was used to rescale variables to aid in the fits. The 
diffusion coefficients were assumed to be the same for each species, 
and where relevant so was the electron-transfer rate, Ky. The symme- 
try factor a was set to 0.5. For approach 4, diffusion coefficients and 
electron-transfer rates were not constrained to be the same for all spe- 
cies. Initial guesses for the fits were set to be in line with the reported 
experimental data. 

For approach 3, the fitted voltage difference was averaged over the 
68 cycles analysed here and found to be 60 mV. This is in line with pre- 
viously reported data*. However, these values are twice what we found 
from our comproportionation calculations coupled with the Evans 
method. The average diffusion coefficient is 2.7 x 10° m’ s*. The aver- 
age potentials (versus Hg/HgO (1 MKOH)) were -0.804 V and -0.864 V. 
The k, values had a far larger distribution in our fits, as the average of 
3x10°cms ‘hada standard deviation between 2x 10?and3 x10?cms7. 

Approach4 gave afitted voltage difference of 30 mV, which matches 
very well with the value of 33 mV derived from the Evans method. 
The other fitted parameters are as follows: diffusion coefficients 
were 3.0 x 10°, 1.0 x 10° and 7.8 x 10° m?s? for DHAQ? , DHAQ? 
and DHAQ*, respectively. The potentials were F, = —0.800 V and 
E,=-0.830 V. ky, and ky, (the electron-transfer rate k, for reactions 1 
and 2, respectively) converged to the same value, 7x10 *cms". 

These approaches (Extended Data Fig. 4h) show that it is far more 
probable that the redox reactions are two single-electron steps instead 


of asingle two-electron step. Itis non-trivial to obtain the voltage differ- 
ence £, -£,, and inconsistent results are returned owing to the complex- 
ity of the fitting procedure. Such nonlinear least-squares fitting canbe 
very sensitive to initial conditions and constraints used. Comparing 
these approaches with the comparative simplicity of using the equi- 
librium constant along with the Evans method shows that for F, — F, all 
the fits are reasonable. We believe that the Evans method provides an 
independent method for determining F,— £, that is—at least inthe DHAQ 
case—associated with smaller errors (that is, itis more accurate) than 
the values provided purely through fitting of the CV data. 

The current model does not capture the asymmetry inthe HOD shift 
as a function of SOC, which leads to an asymmetry in the calculated 
DBEAQ?* radical concentrations as a function of SOC (Extended Data 
Fig. 4d). This suggests that either acompeting reactionis present that 
depletes the radical concentration at higher concentrations of DBEAQ* 
or that possibly, changes in the solvation of the ions with SOC also lead 
to changes in the HOD chemical shift. 


EPR experiment 

The X-band EPR experiment was performed ona solution of 100 mM 
and 1mM DHAQ, both reduced to approximately 50% of their SOC, 
using an EPR spectrometer (Bruker EMX). For the 100 mM DHAQ, the 
field was swept from 3,000 to 4,000 G and the microwave frequency 
was 9.865410 GHz. For the 1 mM DHAQ sample, the field was swept 
from 3,463 to 3,563 G and the microwave frequency was 9.865408 GHz. 
For both experiments, the attenuation was 20 dB, the amplitude of 
modulation was 0.1G, the power was 2 mW, and the sweep time was 
20 s. Asampling tube (30 mm? cm“, Bruker AquaX) was used for data 
acquisition. 

The software package EasySpin was used to fit the spectra**. Spin sys- 
tems of one unpaired electron coupled to different numbers of proton 
and deuterium spins were set up, taking into account the H-D exchange 
in deuterated solvent D,O. For the non-deuterated DHAQ*”, aspin sys- 
tem of three protons with two magnetically equivalent spins for each 
proton was modelled. For singly deuterated DHAQ*”, aspin system of 
two protons was set up that had with two magnetically equivalent spins 
for each proton, another proton witha single spin anda deuteron with 
a single spin. For the doubly deuterated DHAQ*”, aspinsystem of two 
protons was set up that had with two magnetically equivalent spins for 
each protonand one deuteron with two magnetically equivalent spins. 
The fitted variables were g-factors, hyperfine coupling constants, the 
fractions of each component and linewidth. The Nelder-Mead simplex 
method was used. In Extended Data Fig. 3b-d, we present the fitted 
EPR spectra using a single component of non-deuterated DHAQ®”, two 
components of non-deuterated and singly deuterated DHAQ?”, and 
two components of non-deuterated and doubly deuterated DHAQ?”. 
Thetwo-component system comprising non-deuterated (92.6%) and 
doubly deuterated (7.4%) DHAQ*” gives the lowest root-mean-square 
deviation, 0.0175. The fit yields a g-factor of 2.0077 and hyperfine cou- 
pling constants of 0.15 MHz, 2.63 MHz and 4.64 MHz for each unique 
proton. The other two fits yield similar results. The difference between 
the g-factors of the 100 mM (2.0036) and 1 mM (2.0077) solutions is 
possibly due to motional effects from the varying viscosity of the solu- 
tion or the magnetic field drift of the instrument. 

To understand the effect of the hyperfine coupling on the NMR 
chemical shift, we need to correlate the paramagnetic component of 
theshift 6to the isotropic (Fermi-contact) hyperfine coupling constant, 
Aiso- By definition, A;,, describes the strength of the electron-nuclear 
spin coupling (that is, the unpaired spin density at the nucleus) in the 
limit of static (non-flipping) electronic spins. However, at finite tem- 
peratures the paramagnetic behaviour of the electrons means that 
they undergo a rapid flipping between the two spin states of a spin-12 
system. In an EPR experiment where the electronic spin transitions 
are observed on the timescale of picoseconds to nanoseconds, this 
results in a coupling which is evidenced (at least in the dilute system) 


by splitting of the resonance by A,,,. However, in NMR experiments, the 
much longer timeframe of the nuclear spin transitions (microseconds) 
results in a decoupling of the electronic transitions from the nuclear 
transitions. The net effect of this is that only the time-averaged elec- 
tronic spinmomentis felt by the nuclear spin, and the strength of this 
coupling determines the observed Fermi-contact NMR shift. This scal- 
ing is typically done by means of the magnetic susceptibility x, which, 
for a Curie-Weiss system, depends inversely on the temperature 7. 
The isotropic component of the total shift, 6,,,, can then be written as 
a sum of the diamagnetic (chemical shift) and paramagnetic (Fermi- 
contact) components: 6,,, = 6c. + gc. The temperature dependence of 
Orc is expressed as (in ppm) 


-4 {|%\(_A 6 
6rc=Aiso lag] x10 (6) 


where y, (y,,) isthe electron (proton) gyromagnetic ratio, his the Planck 
constant, and k, is the Boltzmann constant. The expected shifts deter- 
mined from the DFT calculations and via EPR are shown in Extended 
Data Fig. 3e, along with 4,,,, for the DHAQ*” radical and the correspond- 
ing EPR data. 

Although the EPR investigation of this redox system is still ongoing 
and subject to a separate study, the preliminary results clearly show 
why the !H NMR shift of proton B on reduction is relatively small. The 
disappearance of the 'H NMR resonances of protons A and C onreduc- 
tionis ascribed tothe much larger hyperfine shifts associated with these 
protons (see Fig. 3d). We have not accounted for any pseudocontact 
contributions tothe NMR shifts, because the effective g-factor is very 
close to the free-electron value, suggesting that they are extremely 
small. 


Application considerations in redox flow batteries and beyond 
Organic redox flow batteries based on inexpensive (for example, the 
estimated price of anthraquinone disulfonic acid is currently in the 
range of US$0.90 per kg to US$3.90 per kg for industrial-scale produc- 
tion)® and sustainable redox-active materials are promising storage 
technologies, which are cheaper and have fewer environmental haz- 
ards compared to the more established and mature vanadium-based 
systems; the price of vanadium pentoxide in Europe in 2018 ranged 
between US$19.40 per kilogram to US$63.50 per kilogram (http://www. 
vanadiumprice.com). 

Because of the simplicity of the on-line NMR setup, whichin essence 
consists of a laboratory-scale redox flow battery and a flow NMR 
sampling tube, we expect adoption of this technique to advance the 
understanding of various redox chemistries, such as quinone-based**””, 
carbonyl-nitrogen-based’*®, radical-based*’, polymer-based”, and 
metal-complex-based" redox chemistries in flow and other battery 
systems? °?°?5!_for example, lithium-air batteries that involve organic 
redox shuttles. This technique can be readily coupled with other (flow) 
characterizations, including in situ mass spectrometry, EPR and opti- 
cal methods. In addition to the study of redox chemistry, the on-line 
technique can be exploited to study the rate of electrolyte crossover, 
which would help improve membrane design. The operando design lays 
the foundation for future magnetic resonance imaging experiments to 
monitor flow, andthe electrolyte/solvent distribution in the electrode. 

The Evans method enables the radical concentration to be deter- 
mined from the magnetic susceptibility. This affords a straightforward 
approach to track the SOC of the anolyte and catholyte, providing 
critical information about cell balancing and how it varies with cycle 
life. This is not easy information to determine from full-cell measure- 
ments without the use of a reference electrode. Of note, our results 
also motivate the development of simpler and cheaper methods to 
extract this information by using a magnetometer or a relaxometer. 
We anticipate that the in situ NMR and related metrologies will contrib- 
ute to our fundamental and practical understanding of flow batteries 
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and thus to the development of longer-lasting devices with higher 
energy densities for large-scale energy storage. 


Data availability 


The data that support the findings of this study are available from 
https://doi.org/10.17863/CAM.46274 and the corresponding author 
upon reasonable request. 
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Extended Data Fig. 1| The in situ NMR probe setups and pulsesequence. 

a, Image of the flow-through (on-line) sampling tube positioned inside a micro- 
imaging probe (Bruker 2.5).b, The operando cell assembly positioned inside 
the NMR probe. Inset, magnification of the components of the cell assembly. 
c, Each spectrum is acquired by collecting n, free induction decays (FIDs; that 


is, scans) witha recycle delay ofd,. Asecond time delay, d,, isintroduced 
between each spectrum. The pulse width, p,, fora 90° pulse is 27 ps. The 
acquisition time for each FID (aq) is1.5s. d, Measured flow rates as afunction of 
rotary speed of the pump. 
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Extended Data Fig. 2 | Thein situ pseudo-2D and stacked ‘HNMR spectra 
versus battery voltage and current.a, Operando NMR spectra of 10 cm? of 
100 mM DBEAQ, against 20 cm? of 150 mM potassium ferrocyanide. The FWHM 
of the water signal acquired via on-line detection is 6 Hz, compared tothe 
FWHM of 59 Hz acquired in the operando setup. b, Shift of the DHAQ-H® and 
water resonances (A6) as a function of SOC measured via on-line and operando 
detection. The shifts of the water resonances were only included in the plot 
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after there was clear peak separation between the water signals arising from 
the catholyte and anolyte electrolytes. c, Labelling of the protons, and DFT- 
derived volumetric plots of the singly occupied molecular orbital of DBEAQ*~. 
The values of the isotropic Fermi contact hyperfine coupling constants (in 
MHz) were derived fromthe DFT calculation. d, On-line NMR spectra of 20cm? 
of 100 mM DBEAQ (anolyte), against 20 cm? of 300 mM potassium ferrocyanide 
(catholyte). 
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Extended Data Fig. 3 | EPR spectra of DHAQ*” radical anions. a—d, Spectra of 
100 mM (a) and 1mM (b-d) DHAQ solution reduced to 50% SOC. The 1mM 
solution spectra were fitted witha single component comprising non- 
deuterated DHAQ*” (b), two components comprising non-deuterated and 
singly deuterated DHAQ®” (c),and two components comprising non- 
deuterated and doubly deuterated DHAQ®” (d). The labels used to describethe 
three proton positions are given in the structure of the radical shown in b. 

The residual is the difference between the experimental and fitted spectra. 
At100 mM, an intense EPR signal of DHAQ* was observed at 3,518 G (g-factor, 


2.0036). However, the signal is strongly broadened by acombination of the 
electron dipolar interactions at such high radical concentrations andthe rapid 
electron-transfer reactions. This leads to the loss of the hyperfine coupling 
features”. At 1mM, hyperfine coupling features were resolved. e, Table 
showing DFT- and EPR-derived isotropic hyperfine coupling constants (A;,,) 
and contact shifts (6,,) of the proton resonances. The source of the difference 
between the DFT- and EPR-derived hyperfine coupling constants is probably 
due to solvent (water) coordination, H-D exchange, and/or electron hopping. 
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Extended Data Fig. 4| CV and concentration of radical anions. a, CV of 5mM 
DHAQand 5 mM DBEAQinD,0 with1M KOH. The potential was scanned at 

20 mV s”, the scan starting by sweeping from positive to negative potentials. 
b, Fraction of DHAQ*~ radicals calculated onthe basis of the Evans methodasa 
function of SOC for DHAQ (100 mM) under acharging current of 1OO mA anda 
flow rate of 13.6 cm? min? (measured twice; red and blue), and a charging 
current of 150 mA anda flowrate of 33.3 cm’ min” (green). The figures onthe 
right and left are the same data plotted without and with error bars. The errors 
are+ half the FWHM of the water signal.c, Concentration of DHAQ*” radicals as 
a function of SOC for 100 mM and 200 mM DHAQ.d, Fraction of radicals asa 
function of SOC for 100 mM DHAQ and 100 mM DBEAQ. e, Relative 
concentrations of AQ”, AQ* and AQ* asa function of SOC of the system and 
the equilibrium constant, K,. AQ?,, AQ*” and AQ* are represented by dashed, 
dotted and solidlines, respectively. K,=1.26 and K.=10.35 correspondto 
values derived for DBEAQ and DHAQ, respectively (corresponding to £, - F, 


values of 6 mV and 60 mV)°”. Curves corresponding toK, =0.1land K,=20 are 
also shown, to illustrate the effect using the smaller and larger values 

E,- £,=-58.3 mV and 76.6 mV, respectively. The former negative value 
illustrates the case in which the radical is strongly disfavoured. f, The AQ?” and 
AQ* concentration at 50% SOC, as a function of K.. AQ* and AQ* are the black 
dotted and blue solidlines, respectively. g, Experimentally derived radical 
concentrations as a function of SOC. Supplementary Information equations 
S6 and S7 were used to fit the experimental data, along withanSOClag 
parameter ranging from 8% to 12%, wherexis the fraction of AQ*” foragiven 
number of electrons n (discussed, as is the derivation of the equations, in 

the Supplementary Information). The fit to the data was done in SOC steps of 
1%.h, The CV of a100 mM DHAQ. ‘2e-Rev’ and ‘2e-QRev’ refer to atwo electron 
reversible and two-electron quasi-reversible model, respectively, and the 
‘letle’ curves (with and without fitting constraints) refer to a two-step, single- 
electron, quasi-reversible process. 
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Extended Data Fig. 5 | On-line’H NMR spectraand battery electrochemical 
profile during galvanostatic cycling. a, 20 cm’ of 1OO mM DHAQagainst 
40cm? of 150 mM potassium hexacyanoferrate(11)/50 mM potassium 


13.6 cm? min“ andthe current is150 mA. With d, =15s,aq=1.5sandn,=8, each 
spectrumisa snapshot of the electrochemical processes averaged over 2.9% of 


the SOC. c, 20 cm’ of 100 mM DBEAQ against 40 cm? of 150 mM potassium 
hexacyanoferrate(II1). The flow rate is 33.3 cm? min“ and the currentis150mA. hexacyanoferrate(11)/50 mM potassium hexacyanoferrate(III). The flow rate is 


With d,=20 s, anacquisition time (aq) =1.5s andn,=4, each spectrumisa 13.6 cm? min“ andthe current is100 mA. With d,=7s, aq=1.5sandn,=8,each 


snapshot of the electrochemical processes averaged over 2.5% of the SOC. spectrumisa snapshot of the electrochemical processes averaged over 2.1% of 
b, 20 cm* of 200 mM DHAQ against 40 cm? of 250 mM potassium the SOC. 


hexacyanoferrate(11)/50 mM potassium hexacyanoferrate(III). The flow rate is 
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spectra.b,c, Variable-temperature NMR spectra of peak B with5 mM (b), and 
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details) inan airtight NMR tube. The lines above the COSY and HSQC spectra 
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Extended Data Fig. 10 | Effect of radicals and flow on longitudinal and c, T,relaxation time of the DHAQ proton B/B” signal asa function of SOC. The 


transverse relaxation. a, The voltage profile during intermittent charging at 
100 mA of 20 cm? of 100 mM DHAQagainst 40 cm? of 200 mM K,Fe(CN),. 

The red arrows indicate the times when the flow and electrochemical cycling 
were paused, and the relaxation measurements commenced. After the 
measurements finished, the flow and electrochemical cycling were resumed. 
b, T, (blue) and T, (green) relaxation times of the HOD signal as afunctionofSOC. 


error bars represent the 95% confidence level from the fit.d,e, Proton NMR 
spectra of DHAQ (H‘, Hand H‘; d) and HOD (e) acquired at different flow rates 
via on-line detection. f, g, Signal integral of HOD (f) and DHAQ (g) as afunction 
of flowrate and residence time inthe detection region of the NMR probe. h, The 
FWHM of the HOD and DHAQ signals asa function of flow rate and residence 
time, andthe labelling of protons. 
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® Check for updates 


Despite being only one-atom thick, defect-free graphene is considered to be 
completely impermeable to all gases and liquids! °. This conclusionis based on 


theory® ®and supported by experiments!” that could not detect gas permeation 
through micrometre-size membranes within a detection limit of 10° to 10° atoms 

per second. Here, using small monocrystalline containers tightly sealed with 
graphene, we show that defect-free graphene is impermeable with an accuracy of 
eight to nine orders of magnitude higher than in the previous experiments. We are 
capable of discerning (but did not observe) permeation of just a few helium atoms per 
hour, and this detection limit is also valid for all other gases tested (neon, nitrogen, 
oxygen, argon, krypton and xenon), except for hydrogen. Hydrogen shows noticeable 
permeation, even thoughits molecule is larger than helium and should experiencea 
higher energy barrier. This puzzling observation is attributed to atwo-stage process 
that involves dissociation of molecular hydrogen at catalytically active graphene 
ripples, followed by adsorbed atoms flipping to the other side of the graphene sheet 
with a relatively low activation energy of about 1.0 electronvolt, a value close to that 


previously reported for proton transpor 


t'’*. Our work provides a key reference for 


the impermeability of two-dimensional materials and is important froma 
fundamental perspective and for their potential applications. 


From atheoretical standpoint, monolayer graphene posesa very high 
energy barrier for the penetration of atoms and molecules. Density 
functional theory (DFT) calculations predict that the energy barrier Fis 
at least several electronvolts” °, which should prohibit any gas permea- 
tion under ambient conditions. Indeed, one can estimate that at room 
temperature, it would take longer than the lifetime of the Universe to 
find an atom energetic enough to pierce a defect-free membrane of 
any realistic size. These expectations agree with experiments that have 
reported no detectable gas permeation through mechanically exfoli- 
ated graphene. The highest sensitivity was achieved using micrometre- 
size wells etched in oxidized silicon wafers, which were sealed with 
graphene!”’°. In those measurements, a pressurized gas (for example, 
helium) could permeate along the SiO, layer and gradually fill the micro- 
containers, making so-called nanoballoons. Their consecutive deflation 
in air was monitored using atomic force microscopy (AFM) and it was 
shown that the leakage occurred along only the SiO, surface, within 
minutes but independently of the number of graphene layers used 
for the sealing'. These studies allowed the conclusion that graphene 
membranes are impermeable to all gases, at least with the achieved 
accuracy of 10°-10° atoms per second. This was further corroborated 
by creating individual atomic-scale defects ingraphenenanoballoons, 
which resulted in their relatively fast deflation/inflation and confirmed 
the exceptionally high sensitivity of the method”””. 

The devices used inthis study were micrometre-size containers 
madefrom monocrystals of graphite or hexagonal boron nitride (hBN) 
using electron-beam lithography and dry etching (Fig. 1, Extended Data 


Fig. 1). The containers were sealed with graphene monolayer crystals 
obtained by mechanical exfoliation and transferred ontop of the wells 
using van der Waals assembly (‘Device fabrication’ in Methods). In 
control experiments, bilayer graphene and monolayer molybdenum 
disulfide (MoS,) were used for the sealing (see further below and in 
Methods). The wells were chosen to have aninner diameter dof 0.5 or 
1.0 pm, and their depth h was about 50 nm to minimizethe containers’ 
volume and, therefore, maximize the sensitivity with respect to the 
number of inflowing gas molecules. The depth could not be reduced 
further because van der Waals attraction of graphene to the inner walls 
caused itto sag!" typically by afewtens of nanometres (Fig. 1c, d). The 
wells’ ring-shaped top was typically 1-m wide to provide a sufficiently 
large atomically flat area so that no gas diffusion could occur along 
the resulting ‘atomically tight’ sealing with its clean and atomically 
sharp interface>"*. The monocrystalline walls of our microcontainers 
were alsoimpermeable, as reported previously” and confirmed inthe 
present work using wells with walls of different thicknesses. The rough 
surface outside the wells (due to etching) helped to pinthe membranes, 
preventing their slippage. The atomically tight sealing is the principal 
difference with respect to the previous experimental setup?” that used 
‘leaky’ SiO,. In our design, the only possible route for the gas ingress/ 
escape is through the two-dimensional (2D) membrane. 

The basic principle used for detection of molecular penetration 
through graphene membranes is similar to that introduced in ref. ! 
and illustrated in Fig. la. The described microcontainers were placed 
insidea chosen gas atmosphere (for example, helium) and, ifgraphene 
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Fig.1| The impermeability of graphene to helium. a, Schematic of our 
experimental setup. b, Electron micrograph of one of the studied containers. 
The image was takenata tilt angle of 20° fora better view. The graphene 
membrane is seen tostretch over the outer wall and attach to the dry-etched 
surface outside. c, AFM image of a similar device. The white curve shows the 
profile of the suspended graphene along the well’s diameter. The vertical bars 
indicate the width (about 150 nm) over which such profiles were averaged. 

d, Examples of AFM profiles for the same container after storing it in helium for 


were permeable to it, the partial pressures inside and outside should 
equalize so that the total pressure inside the containers (filled with 
the less permeating air) would increase with time, resulting in gradual 
lifting and eventual bulging of the membranes. Changes in the mem- 
brane position were monitored with AFM. All the containers were first 
checked for any possible damage to their sealing and the absence of 
atomic-scale defects” as described in Methods (‘Experimental pro- 
cedures’). Only the containers that successfully passed the initial tests 
were investigated further. They were placed in helium, initially for afew 
days. After that, the devices were taken out, measured by AFM within 
10 min to detect minute changes in the membrane position 6 (Fig. 1c) 
and quickly placed back for further exposure to helium. To maximize 
ouraccuracy, AFM mapping was carried out using the PeakForce mode 
and, for repeated measurements, scans were taken in the same direc- 
tion over the same area (AFM measurements’ in Methods). For the 
same reason, we minimized the stress imposed by different pressures 
inside and outside the containers by normally keeping the external pres- 
sure P, at 1bar and varying only the partial pressure Pof the tested gas 
(Methods). Furthermore, we avoided containers withd larger thanlm 
because their scans appeared notably noisier (AFM measurements’ in 
Methods). Our experiments were limited to temperatures T< 60 °C 
because, after thermal cycling to higher 7, graphene membranes were 
often destroyed, probably because of strain induced by thermal expan- 
sion/contraction (Extended Data Fig. 2). 

Under exposure to helium, no changes in 6 could be detected, as 
detailed in Fig. 1d—f, Extended Data Fig. 3. The figures show that the 
membrane positions did not change regardless of how long they were 
exposed to helium. For example, Fig. le plots our results for more than 
a dozen containers over an observation period of one month. None of 
the devices showed any discernible changes (A6) in the membranes’ 
original positions, beyond small random fluctuations that did not 
exceed 0.5nminamplitude. Statistical analysis of the Ad values yielded 
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days (colour coded). e, Changes in the maximum deflection point for 14 
containers placed in helium over a one-month period. Different symbols 
denote containers made from graphite (empty symbols) and hBN (solid). The 
orange lines indicate the experimental scatter (full range of observed A6) for 
one of the devices represented by the same colour. f, Permeation rates 
evaluated from the datain e; samesymbol coding. Error bars are standard 
deviation by fitting 6 with the linear time dependence. The grey area indicates 
the overall standard deviation using the data for all the devices. 


astandard deviation of about 1A (Fig. 1e; ‘AFM measurements’ in Meth- 
ods). In control experiments, we carried out the same measurements 
in air and found fluctuations of a similar amplitude (Extended Data 
Fig. 3). For higher applied P, the fluctuations were slightly stronger, 
presumably because the extra pressure caused creep of the membranes 
(Extended Data Fig. 3). 

Forsmall changes Ad inthe membrane position, the number of atoms 
or molecules AN penetrating through the area S is given by”"® 


ams 
AN= cy 5-506 


(1) 
where k,7 is the thermal energy, k, is the Boltzmann constant and 
c= 0.5 is the coefficient that accounts for the membrane’s curved 
profile (‘Evaluation of permeation rates and their accuracy’ in Meth- 
ods). The above accuracy of roughly 1A over one month translates 
into no more than a few atoms entering the microcontainers per 
hour. This accuracy is more than eight orders of magnitude higher 
than that achieved inthe earlier experiments reporting graphene’s 
impermeability’®°, which were in turn a few orders of magnitude 
more sensitive than the detection limit of modern helium leak 
detectors. In terms of the areal permeation rates [= (dAN/d?t)/S, 
our experiments yield an upper bound of about 10° sm” for pos- 
sible helium transparency of defect-free graphene. To put this into 
perspective, monolayer graphene is less permeable than 1-km-thick 
quartz glass. Furthermore, the found limit allows a lower-bound 
estimate for the energy barrier F that graphene presents for helium 
atoms. Using the expression (‘Energy barriers’ in Methods) 


P E 
T= exp] - 2 
.f21mk,T p az| (2) 


we obtain F > 1.2 eV, where the pre-exponential factor describes the 
incident rate of helium atoms and m is their mass. This estimate is 
consistent with the barriers found theoretically” *. Keeping in mind 
that it is hardly possible to improve the accuracy for Aé beyond1A, 
the fluctuations increase for larger d, and observations longer than a 
few months and at considerably higher 7 are impractical, our results 
probably present the sensitivity limit of the nanoballoon method. 

Helium with its small weakly interacting atoms is recognized as the 
most permeating of all gases. Nonetheless, we tested several other 
gases (namely, Ne, N,, O,, Ar, Kr and Xe) and, as expected, found no 
discernible permeation. This places practically the same limit on their 
F.Unexpectedly, monolayer graphene exhibited noticeable transpar- 
ency with respect to molecular hydrogen, H,. We first illustrate this 
observation qualitatively, by showing in Fig. 2a one of our microcon- 
tainers before and after its exposure to hydrogen at 50 °C for 3 d. The 
membrane clearly bulged up, although the same container passed our 
impermeability tests with respect to both helium and argon at the same 
T. This observation is striking because even atomic hydrogen, witha 
diameter smaller than that of helium, is predicted to experience an 
E of 2.6-4.6 eV for monolayer graphene‘ °, leaving aside the fact that 
dissociation of molecular hydrogen requires about 4.5 eV, which makes 
theconcentration of atomic hydrogen negligible. Molecular hydrogen 
is expected’ to have even higher F of more than 10 eV. For such high bar- 
riers, hydrogen permeation is completely forbidden and, according to 
equation (2), it should take billions of years for a single hydrogen atom 
to get inside the container. In another control experiment, we used 
microcontainers sealed with bilayer graphene and monolayer MoS,. 
They exhibited no detectable permeation under multiday exposureto 
molecular hydrogen at 50 °C (Extended Data Fig. 4). 

To quantify the observed hydrogen permeation, we measured 
changes in 6asa function of time for many devices at room temperature 
(295 +2K). They exhibited approximately the same inflation rates within 
scatter of about +15%, as indicated by the dashed lines in Fig. 2b, which 
yields = 2x10'stm~. Note that sucha minute gas influx is far beyond 
the detection limit for microcontainers with SiO, sealing!™°. Further- 
more, working in the regime of small linear-in-time Aé (no bulging as 
in Fig. 2a), we measured hydrogen permeation at different 7. The tem- 
perature dependences followed the Arrhenius law, [ « exp(-£/k,7), 
yielding an activation barrier of 1.0 + 0.1 eV (Fig. 2c). This relatively 
small EF strongly disagrees with the theoretical expectations and, more 
importantly, with the fact that smaller helium atoms did not penetrate 
through the same membranes. 

Trying to understand the origin of the unexpected behaviour, we 
performed two additional sets of experiments. First, we quantified the 
hydrogen permeation rates at different pressures Pand found / « PY? 
(Extended Data Fig. 5). The square-root dependence is characteristic 
of processes involving an equilibrium between adsorbed and desorbed 
constituents of a bipartite gas”, in contrast to the linear dependence 
of equation (2) valid for weakly interacting atoms (‘Energy barriers’ in 
Methods). Second, we measured permeation for hydrogen’s isotope 
deuterium. Within our detection limit, no permeation could be dis- 
cerned, which puts a limit of /< 10° sm” on the deuterium influx 
(‘Isotope effect’ in Methods and Extended Data Fig. 8). 

To understand the reason for the exclusivity of hydrogen among the 
other gases, let us recall the following facts. Locally curved and strained 
graphene surfaces are known experimentally to be chemically reac- 
tive?” and are expected to lower the energy required for dissociation 
of molecular hydrogen”. For alocal protrusion (ripple) with t/D >5% 
(where Cis its height and D the lateral size), the dissociated state with 
two hydrogen adatoms becomes energetically more favourable”””?, 
whereas the energy barrier required to reach this state is also reduced 
to about 1eV (‘Ab initio simulations of graphene’s catalytic activity’ 
in Methods). This catalytic activity of graphene is relevant because 
suspended membranes exhibit extensive nanoscale rippling”*”¢ with 
t/D that can easily exceed 5% for both static’** and dynamic” ripples 
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Fig.2|Hydrogen permeation through defect-free graphene.a, AFMmicrographs 
of thesame microcontainer before (left) and after (right) storing it for3 dinmolecular 
hydrogenat lbar. Tospeed up permeation, the gas was heated to50 °C. Whitecurves 
showthe height profiles along the well’s diameter. Tens of microcontainers were 
tested, showing the sameeffect. The somewhat darker outside region of the well’stop 
appears becauseitis not atomically flat but slightly tapered (our lithography masks 
oftenthinned towards the outside perimeter, allowingsome plasma etching of the 
rim region). b, Time evolution of Aé for 12 different devicesin molecular hydrogenat1 
barat T=295+2K. Theemptyandsolid symbols denote graphite and hBN wells, 
respectively. Blue and orange dashed linesare the best linear fits fortwo of the devices 
(colour coded) to indicate experimental scatter. Inset: representative changes inthe 
AFMprofiles withtime.c, Hydrogen permeation rates at different 7. Symbols are 
experimental dataand the solid curve is the best fit tothe activation behaviour, which 
yieldsE=1.0+0.1eV. Top inset: sameas the main panel, but for P=0.1bar. Error barsare 
standard deviation using six or more devices for each 7. Bottominset: illustration of 
the flipping process in the suggested mechanism of hydrogen permeation. Thegrey 
areasinb, cindicate our detection limit. 
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(Extended Data Fig. 7). Another fact is that monolayer graphene is 
known to be highly permeable to protons, exhibiting an activation 
energy of 1.0 + 0.05 eV, whereas bilayer graphene and monolayer MoS, 
exhibit no detectable proton permeation” ”. This is relevant because 
a hydrogen atom absorbed on graphene shares its electron with the 
conducting surface and is indistinguishable from an adsorbed pro- 
ton. Furthermore, it is also known that deuterons, nuclei of deuterium 
atoms, experience a higher barrier than protons, which drastically 
slows their permeation through monolayer graphene” (‘Isotope effect’ 
in Methods). 

On the basis of the above facts, we propose the following scenario 
for the observed hydrogen permeation. First, molecular hydrogen is 
chemisorbed (adsorbed and dissociated) on graphene ripples, which 
results in sp?-bonded adatoms as illustrated in Extended Data Fig. 6. 
These adatoms then flip to the other side of grapheneina1.0-eV transfer 
process similar to that previously reported for proton transport!” 
(inset of Fig. 2c). The flipped adatoms subsequently desorb from the 
concave surface. This scenario is fully consistent with all the experi- 
mental evidence and, also, explains why the observed permeation 
is limited to hydrogen and monolayer graphene. Indeed, among the 
tested 2D crystals, only the latteris sufficiently transparentto protons. 
Neither bilayer graphene nor monolayer MoS, allow protons through", 
whereas monolayer graphene presents a notably higher barrier for 
heavier deuterons than protons” (see ‘Isotope effect’ in Methods). 

Although our experiments cannot distinguish directly whether itis 
chemisorption or flipping that limits the hydrogen permeation, the 
close match of the measured F with the value reported in ref. ? hints 
that the flipping is likely to be the rate-limiting process. This is also 
supported by the observed isotope effect. Indeed, our DFT calculations 
could not find any influence of zero-point oscillations on hydrogen’s 
dissociation (see ‘Isotope effect’ in Methods). However, the flipping is 
expected to exhibit an isotope shift because zero-point oscillations 
decrease the energy of the initial state in the transfer process”. This 
shift results in a higher effective barrier for deuterium and makes its 
permeation undetectable in our experiments (see ‘Isotope effect’ in 
Methods). The dependence I~ P’? (suggesting a finite coverage of 
graphene with hydrogen) is also consistent with the flipping step being 
the limiting factor. Indeed, it is easier for lighter adatoms to desorb 
from graphene (because of stronger zero-point oscillations), which 
should result in higher coverage of the graphene surface with deute- 
rium. Accordingly, if chemisorption were the limiting step, higher 
permeation rates would be expected for deuterium rather than hydro- 
gen, contrary to our observations. 

To conclude, defect-free graphene should be completely imperme- 
able to all atomic and molecular species at room temperature, but 
ripples, wrinkles and other defects inducing a local curvature are cata- 
lytically active and allow non-negligible permeation of hydrogen. If 
necessary, the latter can be blocked by using bilayer graphene or other 
2D materials such as monolayer MoS,. Our results have implications 
for many observations inthe literature. For example, ripples are likely 
to play an important role in lowering barriers for proton transport 
through 2D membranes", a distinct possibility not considered so far 
theoretically’””’. Similarly, the observations may shed light on the 
intercalation of graphene onsilicon carbide by molecular hydrogen that 
is argued to permeate through defect-free graphene*°*!. The discussed 
processes may also be critical for the interaction of graphene with water 
and hydrocarbons and, more generally, emphasize unexpectedly high 
catalytic activity ofnon-flat graphene, in stark contrast to the extreme 
chemical inertness of its bulk counterpart, graphite. 
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Methods 


Device fabrication 

Our devices were fabricated as shown schematically in Extended Data 
Fig. la. A monocrystal of either graphite (NGS Naturgraphit) or hBN (HQ 
Graphene) with a thickness of at least 150 nm was first mechanically 
exfoliated onto an oxidized silicon wafer that was freshly cleaned in 
an oxygen plasma. The quality of the crystal’s top surface was carefully 
checked for the presence of atomic terraces using dark-field and differ- 
ential-interference-contrast microscopy. These modes allow detection 
of crystal edges, terraces and tears, even for a monolayer thickness. 
Using e-beam lithography, a set of ring-shaped polymer masks with an 
inner diameter d of 0.5 or 1.0 pm was patterned on atomically flat parts 
of the surface (without terraces). Reactive-ion etching was then used to 
remove roughly 50 nm of the exposed areato form micrometre-diameter 
wells (Extended Data Fig. 1b). After the lithography mask was dissolved, 
we annealed the structures in a hydrogen and argon (1:10) atmosphere 
at 400 °C for 6h. Thena relatively large crystal of monolayer graphene 
(also obtained by mechanical exfoliation) was transferred in air ontop 
of the wells using the standard transfer procedures for assembly of 
van der Waals heterostructures”*?. Extended Data Fig. 1b shows an 
optical image of an array of graphene-sealed hBN wells. Closer views 
of such microcontainers are provided in Fig. 1, Extended Data Fig. 2a, 
where one can clearly see graphene membranes draping over the wells. 
For comparison, Extended Data Fig. 2b shows a broken graphene mem- 
brane after our unsuccessful attempt to test it at 80 °C. As the inner walls 
of the containers are not perfectly round (see, for example, Fig. 1b, c), 
graphene membranes sag inside in a slightly asymmetric manner as 
noticeable in some AFM profiles (for example, Fig. 1d). 


Experimental procedures 
After the fabrication, microcontainers were first checked with AFM 
for possible tears, wrinkles and other defects. Only devices with seem- 
ingly perfect sealing were used for further investigation. Those were 
tested further by placing them in a3-bar argon atmosphere overnight. 
Occasionally, we found inflated containers that deflated quickly in 
air. In principle, this could be due to defects””°**, but in most cases we 
could trace the leakage to poor sealing of the microcontainers: either 
thetop surface of the wells was slightly damaged by dry etching sothat 
the rough streaks connected the inner and outer rim edges or small 
wrinkles were present, as retrospectively revealed by dedicated AFM 
analysis and scanning electron microscopy. The devices that passed 
the above tests were placed in a tested gas atmosphere and their pos- 
sible inflation was carefully monitored as described in the main text. 
For gas tests, microcontainers were placed inside a small stainless- 
steel chamber. It was evacuated to approximately 10° mbar and then 
one of the gases under investigation was introduced inside. For studies 
of temperature dependences, the whole vacuum chamber was placed 
inside an oven with controllable T. 


AFM measurements 

To monitor changes in the position of the graphene membranes, we 
employed the PeakForce mapping mode (Dimension FastScan from 
Bruker). The use of this AFM mode was essential to achieve the high- 
est possible accuracy in our measurements of the membrane posi- 
tion. The PeakForce mode minimizes tip-to-sample interactions by 
employing an imaging force that can beas small as about 1nN and has 
little effect on suspended graphene. For comparison, when we tried 
the contact-mode AFM imaging, graphene membranes were found to 
sag down after each scan by as muchas several nanometres, which was 
obviously unacceptable for our purposes. The PeakForce mode also 
allows straightforward analysis of the obtained scans, compared with 
the tapping AFM mode wherea non-negligible pressure induced by the 
tip requires rather involved deconvolution of AFM images)». Further- 
more, to maximize reproducibility between consecutive PeakForce 


scans, they were taken always along the same direction, x, across the 
centre of suspended graphene membranes and averaged over a finite 
width of roughly 150 nm (vertical bars in the shown AFM images). This 
approachalso allowed us to increase accuracy by avoiding changes in 
AFM profiles caused by the slight asymmetry in membranes sagging, 
as pointed out in the Methods section ‘Device fabrication’. The asym- 
metry remains constant during measurements for a given device and 
does not affect our results. The averaged deflection profile 6(x) allowed 
us to detect minor changes Adin a membrane’s lowest position at the 
well’s centre (denoted above as 6) such that 6 = 6(0) + Ad, where 6(0) 
is the initial-in-time position at the well’s centre. 

Extended Data Fig. 3a, b shows the accuracy and reproducibility of 
our measurements of 6(x) and A6. For this dataset, two microcontain- 
ers were measured in air as described above, and ten AFM scans were 
taken at 1-h intervals. Between each scan, the devices were taken out 
of the AFM setup and then placed back to mimic the real measure- 
ment procedures. The figures show that the profiles 6(x) were very 
stable, and the resulting Aé did not exceed about 0.3 nm. The statistical 
uncertainty (standard deviation) for this set of AFM measurements was 
about 0.16nm. To check for longer-term stability, 12 microcontainers 
with d=0.5 and 1.0 pm and different sagging (maximum depth 6(0) 
varied between 5 and 25 nm) were kept in air for more than 20 d. Their 
height profiles were captured at regular intervals, a few days apart. The 
devices also exhibited excellent stability such that Aé did not exceed 
0.3 nm (Extended Data Fig. 3c), ingood agreement with the short-term 
results in Extended Data Fig. 3a. 

After exposing microcontainers to higher helium pressures, we again 
did not observe any discernible changes in 6(x) but random fluctua- 
tions in Ad increased (Extended Data Fig. 3e, f), presumably because of 
additional stresses induced by pressure. Note that, trying to improve 
our measurement accuracy further, we also made and tested micro- 
containers with d>2 um. However, their stability was much worse, with 
Aé exceeding 1nm, probably due to increasing instabilities caused by 
tip-membrane interactions. Data from such wells were not used in 
the reported analysis. 


Evaluation of permeation rates and their accuracy 

Equation (1) can be deduced from the expression derived in ref. ’ as 
follows. The pressure Pinside the container includes two components: 
one is the initial atmospheric pressure of the trapped air (P,) and the 
other is AP, the pressure change induced by molecular permeation. 
For small changes inthe membrane position, APcan be estimated from 
the Hencky solution® as 


AP=K(v)EL(A6)?/a* (3) 


where Fis Young’s modulus, L isthe membrane thickness, ais the radius 
of the container and K(v) is the coefficient that depends on Poisson’s 
ratio v. For graphene®’”, F = 1 TPa, v = 0.16 and K(v) = 3.09. The gas 
volume inside the container is given by V= V, + AV= Sh+cS6, where 
Sand hare the container’s area and depth, respectively, and c= 0.5 
is the numerical coefficient accounting for the curved shape of the 
membrane. Putting the above expressions for Pand Vinto the ideal gas 
law PV=(N,+ AN)K,T (N, is the initial number of air molecules), we find 


AN=(P,AV+ VoAP+ APAV)/kgT (4) 


Substituting the expressions for AV and AP, equation (4) can be 
written as? 


K “ K = 


AN= if er, A6+ (h + c6(0))—7—(A6)? +¢ Kor (00 (5) 


For the known constants and noticing that the largest deflection Aé 
used in our quantitative analysis was only about 4 nm, we find that the 
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second and third terms of equation (5) should not exceed 20% of the 
linear term’s value. Therefore, for the purpose of our analysis, equation 
(5) can be simplified to equation (1). This also agrees with the fact that 
Aé evolved linearly in time, within our experimental scatter (Fig. 2b). 
If contributions of the nonlinear terms were considerable, Aé should 
start evolving nonlinearly as a function of AN and, hence, time. 


Energy barriers 
Helium permeation through the barrier presented by a graphene mem- 
brane can be estimated using 


[= Aexp- i] (6) 
B 


where Fis the energy barrier for incident atoms and A is their attempt 
rate (that is, the number of atoms striking a unit area per second). 
Weakly interacting helium atoms are not adsorbed onto graphene 
and, therefore, the attempt rate is given by*® 
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where u¢= | 8kg7 /tmis the mean speed of helium atoms and mis their 
atomic weight. Combining equations (6) and (7), we obtain equation 
(2). However, ifgas atoms or molecules become adsorbed ona graphene 
surface, like in the case of hydrogen, equation (7) isno longer applica- 
ble, andthe attempt rate depends onan equilibrium density of adsorbed 
species. Under the latter circumstances, the different dependence 
A~«./ P is expected”, in agreement with our results for hydrogen in 
Extended Data Fig. 5. 


Ab initio simulations of graphene’s catalytic activity 
Energy barriers for the dissociation of molecular hydrogen on flat and 
rippled graphene were calculated from first principles using DFT, as 
implemented inthe Viennaab initio package’. The generalized gradi- 
ent approximation* and projected augmented wave were adopted to 
describe the exchange correlation potential and ion-electron interac- 
tions. The kinetic energy cutoff and k-point mesh were set to 500 eV and 
7x7x1, respectively“. A vacuum region of 20 A was used to avoid the 
periodic interaction. The stress force and energy convergence criteria 
were chosen as 0.01eV A“ and 10“ eV, respectively. The van der Waals 
interactions were included in the dissociation process and treated by 
the semi-empirical DFT-D3 method’. A supercell of 8 x 8 graphene 
unit cells was adopted for the simulations, and ripples were character- 
ized by the ratio ¢/D of their height tto the corrugation diameter D (inset 
in Extended Data Fig. 6a). The energy barrier forthe reaction pathway 
was calculated using the climbing-image nudged elastic band method, 
in which the total energies of initial, final and several intermediate states 
during the reaction process were calculated explicitly**. The initial state 
was constructed as follows”. First, we created a corrugated graphene 
supercell with a certain ¢/D by allowing the atomic structure to relax 
under biaxial compression. Next, two hydrogen atoms were attached 
to specified carbon atoms, and the whole system was allowed to relax 
to its ground state, during which the positions of unoccupied carbon 
atoms were fixed to keep the t/D value constant. The relaxed carbon 
structure was then used as the initial configuration and the electron 
distribution was optimized during the reaction process. 

For agivent/D, there are many possible corrugated configurations. 
If we consider high-symmetry configurations, the corrugation centre 
is located either at the top of a carbon atom or between two nearest 
neighbours or at the hexagon centre. To minimize the dissociation 
energy, we relaxed the above three structures of rippled graphene with 
two adsorbed hydrogen atoms and used themas the initial states before 
hydrogenation. The initialized graphene ripple could be allowed to 
relax further before chemical reaction, but we found that this caused 


little effect on the energy barrier. After trying many different con- 
figurations and reaction processes, we found that the dissociation 
energy reached a minimum when two opposed sites ina hexagon were 
hydrogenated (see the insets in Extended Data Fig. 6a). In Extended 
Data Fig. 6a, we show changes in the total energy during the reaction 
process for t/D=7.5%. The dissociation energy barrier isabout1.1eV and 
given by the difference between the initial and highest energy states 
along the reaction pathway. For comparison, the dissociation energy 
of molecular hydrogen in vacuum is about 4.5 eV (ref. °), which shows 
that ripples are highly catalytically active. 

The dissociation energy depends on where in the unit cell hydrogen 
atoms are adsorbed. For example, Extended Data Fig. 6b shows the 
adsorption process for the same ¢/D as in Extended Data Fig. 6a but 
with hydrogen atoms attached to the nearest carbon atoms. In this 
case, the dissociation energy barrier is higher (approximately 2.9 eV). 
Our results for different ripple curvatures ¢/D are plotted in Extended 
Data Fig. 6c. Clearly, the dissociation energy decreases monotonically 
with increasing curvature, and the changes become rather gradual for 
t/D>4%. Note that the critical curvature, at which ripples become ener- 
getically favourable for dissociation of molecular hydrogen (t/D ~2.5%), 
is smaller than t/D ~ 4%, which was reported in the earlier study”. This 
is because of improvements in the simulation method and optimized 
atomistic configurations. 

Although graphene membranes are known to contain numerous 
extrinsic (static) ripples”*”> that have typical t/D ~ 5%, it is instructive 
to find what kind of intrinsic (dynamic) ripples one can expect due to 
thermal fluctuations”. To this end, we performed molecular dynamics 
simulations using the Large-scale Atomic/Molecular Massively Parallel 
Simulator (LAMMPS)* and graphene membranes consisting of 387,200 
atoms. Periodic boundary conditions were usually employedto mimic 
an infinite membrane, but we also performed simulations for finite- 
size membranes (from about 35 to 100 nm in diameter). The lateral 
size of ripples (D) ranged between a few and 10 nm, independently of 
the membrane size. Their typical configurations at different 7 were 
obtained after thermalization in 100,000 steps (0.00025 femtoseconds 
per step) and averaging over 20 of such snapshots (Extended Data 
Fig. 7a). Extended Data Fig. 7b, c showsthe areal density for ripples with 
t/D>4%, which are most catalytically active. One can see that thermal 
fluctuations generate many such ripples that can result in dissociation 
of molecular hydrogen. 

It is not clear whether static or dynamic ripples dominate the 
adsorption-dissociation process for graphene membranes. One 
of the issues limiting a contribution from thermally excited ripples 
could be their relatively short lifetimes. Our simulations show that 
the mean half-life of a ripple, during which its ¢/D drops to half, is of 
the order of femtoseconds. For comparison, permeation of adsorbed 
hydrogen atoms through graphene involves the timescale t, which 
can be estimated from their adsorption energy as £,, = h/t where 
his Planck’s constant. For atomic hydrogen on graphene’, F, , is 
expected to be approximately 0.4-1.0 eV, and this yields t of a few 
femtoseconds. This is of the same order of magnitude as the charac- 
teristic lifetime of ripples. 

Besides static and dynamic ripples, there are strained areas around 
the rim of our microcontainers, which in principle could also contrib- 
ute to the observed hydrogen permeation. However, this scenario is 
ruled out by the experimental fact that the observed permeability was 
proportional to the membrane’s area rather than its circumference. 
Furthermore, permeation rates were the same for devices with dif- 
ferent sagging (varying from about 5 to 40 nm), which led to different 
strain. The above conclusion is also supported by our DFT calculations 
in whichthe chemisorption process was considered for flat graphene 
under strain. Extended Data Fig. 6d shows that the dissociation energy 
remains high (about 3 eV) even for strains as high as about 15%. This 
proves that it is the curvature rather than strain that is important for 
the chemisorption process. 


Isotope effect 

The proposed mechanism of hydrogen permeation involves several 
steps: dissociation of molecular hydrogen on graphene ripples lead- 
ing to afinite coverage of the surface with hydrogen atoms; flipping 
of the atoms across the membrane in a proton-like transfer pro- 
cess; and their recombination and desorption as molecular hydro- 
gen. Only the chemisorption and flipping are expected to involve 
sufficiently high energy barriers as discussed in the main text. To 
determine which of these two key barriers limits the observed 
permeation, we performed experiments using deuterium. To this 
end, ten microcontainers were sealed with monolayer graphene 
and exposed to deuterium at 1 bar at room temperature. Ad was 
monitored asa function of time. In stark contrast to our molecular 
hydrogen experiments, the devices did not show any discernible 
changes in Aé (Extended Data Fig. 8). The absence of deuterium 
permeation was further verified in elevated-7 tests such as those 
in Fig. 2a. Several devices were exposed to deuterium at 50 °C for 
3d, but none exhibited any bulging, in contrast to the molecular 
hydrogen experiments of Fig. 2a. 

To understand theisotope effect, we carried out DFT calculations for 
chemisorption of deuterium on rippled graphene. After including cor- 
rections due to zero-point oscillations, we found little difference inthe 
dissociation energies with respect to molecular hydrogen (Extended 
Data Fig. 6). However, desorption of deuterium from the graphene 
surface should be slower because of the same quantum corrections. 
Hence, surface coverage for deuterium atoms should be higher than 
that for hydrogen. The latter isotope effect is well known for both gra- 
pheneand graphite*”** and implies that, if chemisorption were the rate- 
limiting process, higher permeation rates would have been expected for 
deuterium rather than molecular hydrogen, contrary to our observa- 
tions. The latter conclusion indicates again that the limiting process is 
the proton flipping. Indeed, in this case, one expects another isotope 
effect analogous to that observed inthe transport experiments of refs. 
249 where deuterons exhibited ten times lower conductivity through 
graphene than protons. The reduction factor R has been attributed to 
the fact that deuterons have alower energy in their initial bound state 
in Nafion””, which results in an effectively higher barrier for flipping 
(by AF ~ 60 meV). This energy shift leads to the ratio between hydrogen 
and deuterium permeation, whichis given by R = exp(AE/k,7) ~ 10 (ref. 
?). For the case of atomic hydrogen/deuterium adsorbed on graphite, 
AE was found‘ to be around 90 meV. Assuming the same value for 
graphene, this shift should resultin about 35 times slower permeation 
of deuterons with respect to protons. Because of the detection limit 
of about 10° sm”, we can only conclude from our experiments that 
R>20,in good agreement withthe above expectations. This supports 
the proposed scenario in which the flipping step limits hydrogen per- 
meation through monolayer graphene. 
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Extended Data Fig. 1| Device fabrication. a, GraphiteorhBN monocrystalsare tosealthe wells. b, Optical micrograph ofa set of hBN microcontainers. The 
obtained by mechanical exfoliation. Micrometre-size wells are then made by dashed curveindicates the position of monolayer graphene. 
e-beam lithography and ion etching. Monolayer grapheneis transferred ontop 


Extended Data Fig. 2 | Electron micrographs of our microcontainers. a, A microcontainer with d=0.5 um. Suchimageswere taken only after finishing 
measurements to avoid electron-beam damage. b, Example ofa broken graphene seal: the membrane was damaged after athermal cycle to 80 °C. 
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Extended Data Fig. 3 | Stability of graphene membranes inair and helium. different microcontainers made from graphite (empty symbols) and hBN 
a, Changes in 6(0) measured for two containers withd=1 tm; AFM scans were (solid). The dashed lines inc, eindicate maximum changes detected for 
taken every hour. b, Representative profiles 6(x) for a.c, Long-term variations representative devices (colour coded). Ind, f, the colour represents the same- 
in 6for12 different containers kept in air. d, Permeation rates evaluated from colour device as inc, e, respectively. Error bars are standard deviation for 


the evolution of Aéwithtimeinc.e, Ad for 16 different devices placed in helium fitting 6with a linear time dependence. Grey areas are the overall statistical 
at 3 bar. f, Permeation rates for the data ine. Inc, e, different symbols denote accuracy obtained using all our devices measured in air and helium at 3 bar. 
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Extended Data Fig. 4 | Bilayer grapheneand monolayer MoS, are 
impermeable to hydrogen.a, AFM micrographs of the same container sealed 
with bilayer graphene before (left) and after (right) its exposureto molecular 
hydrogenat1bar at 50 °C for3 d. White curves show the profiles along the 
membrane’s diameter. No changes in membrane positions could be detected 
within our experimental accuracy. b, Same experiment for monolayer MoS,. No 
changes could be noticed either. c, For comparison, we show the simultaneous 


6=-29.0nm 


+8.6nm 


experiment for a microcontainer covered with monolayer graphene. The 
membrane clearly bulged out after the exposure, similar to the case of Fig. 2a. 
Allscale bars, lum. After the experiment, the bulging membrane inc was kept 
under ambient conditions and found to slowly deflate over months, in 
agreement withthe room-temperature permeation rates reported inthe main 
text. 
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Extended Data Fig. 5 | Pressure dependence of hydrogen permeation. 
Symbols are measurements at room temperature. Error bars are standard 
deviation using a minimum of ten devices in each case. Solid curve is the best fit 
tothe square-root dependence. 
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Extended Data Fig. 6| Dissociation of molecular hydrogen at graphene function of ripples’ curvature. Insets show top view of the bridge and central 
ripples. a, b, Reaction of molecular hydrogen with graphene for t/D = 7.5% if positions for hydrogen adatoms. d, The dissociation energy barrier asa 
adatoms are adsorbed in the central (a) and bridge (b) positions. Insets: atomic function of biaxial strain. The inset shows schematic showing the direction of 
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Extended Data Fig. 7 | Intrinsic (dynamic) ripples in graphene at different 
temperatures. a, Typical snapshot of graphene membraneat 300 K using 
molecular dynamics simulations. b, Density of ripples with t/D = 7% (most 
chemically active). Symbols are the calculations for different T. Solid curve is a 
guide tothe eye.c, Statistical distribution ofintrinsic ripples with different t/D. 
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Extended Data Fig. 8 | Isotope effect. Time evolution of Aé for ten different 
devices (different colours) exposed to deuterium at 1bar at room temperature 
(295 +2K). Allthe devices are hBN containers sealed with monolayer graphene. 
The dashed lines indicate maximum changes detected for the device coded 
withthe same colour. The random fluctuations are closein amplitude to those 


shownin Fig. le for helium. 
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The Greenland Ice Sheet has been a major contributor to global sea-level rise in recent 
decades’”, and it is expected to continueto be so*. Although increases in glacier flow* ® 
and surface melting’ ’ have been driven by oceanic’° ” and atmospheric’ warming, 
the magnitude and trajectory of the ice sheet’s mass imbalance remain uncertain. Here 
we compare and combine 26 individual satellite measurements of changes in the ice 
sheet’s volume, flow and gravitational potential to produce areconciled estimate of its 
mass balance. The ice sheet was close to a state of balance in the 1990s, but annual 
losses have risen since then, peaking at 345 + 66 billion tonnes per year in 2011. Inall, 
Greenland lost 3,902 + 342 billion tonnes of ice between 1992 and 2018, causing the 
mean sea level to rise by 10.8 + 0.9 millimetres. Using three regional climate models, we 
show that the reduced surface mass balance has driven 1,964 + 565 billion tonnes (50.3 
per cent) of the ice loss owing to increased meltwater runoff. The remaining 1,938 + 541 
billion tonnes (49.7 per cent) ofice loss was due to increased glacier dynamical 
imbalance, which rose from 46 + 37 billion tonnes per year in the 1990s to 87 + 25 billion 
tonnes per year since then. The total rate of ice loss slowed to 222 +30 billion tonnes 
per year between 2013 and 2017, on average, as atmospheric circulation favoured 
cooler conditions” and ocean temperatures fell at the terminus of Jakobshavn Isbree”. 
Cumulative ice losses from Greenland as a whole have been close to the rates predicted 
by the Intergovernmental Panel on Climate Change for their high-end climate warming 
scenario”, which forecast an additional 70 to 130 millimetres of global sea-level rise by 
2100 compared with their central estimate. 


The Greenland Ice Sheet holds enough water to raise mean global sea 
level by 7.4 m (ref. !8). Its ice flows to the oceans through a network of 
glaciers and ice streams”, each with a substantial inland catchment”. 
Fluctuations in the mass of the Greenland Ice Sheet occur due to vari- 
ations in snow accumulation, meltwater runoff, ocean-driven melt- 
ing and iceberg calving. There have been marked increases in air”! 
and ocean” temperatures and reductions in summer cloud cover” 
around Greenland in the past few decades. These changes have pro- 
duced increases in surface runoff®, supraglacial lake formation” and 
drainage”, iceberg calving”, glacier terminus retreat”, submarine 
melting!" and ice flow’, leading to widespread changes in the surface 
elevation—particularly near the margin of the ice sheet (Fig. 1). 

Over recent decades, ice losses from Greenland have made a sub- 
stantial contribution to global sea-level rise”, and model projections 
suggest that this imbalance will continue in a warming climate’. Since 
the early 1990s there have been comprehensive satellite observations 
of changing ice sheet velocity*®, elevation”’ ”’ and, between 2002 and 
2016, its changing gravitational attraction*””, from which complete 
estimates of Greenland Ice Sheet mass balance are determined‘. Before 
the 1990s, only partial surveys of the ice sheet elevation” and veloc- 
ity? change are available. In combination with models of surface mass 
balance (SMB; the net difference between precipitation, sublimation 
and meltwater runoff) and glacial isostatic adjustment™, satellite 


measurements, reported by the 2012 Ice Sheet Mass Balance Inter- 
comparison Exercise (IMBIE)!, have shown a fivefold increase in the 
rate of ice loss from Greenland overall, rising from 51+ 65 Gt yr ‘inthe 
early 1990s to 263 + 30 Gt yr“ between 2005 and 2010. This ice loss 
has been driven by changes in SMB” and ice dynamics*”. There was, 
however, a marked reduction in ice loss between 2013 and 2018, asa 
consequence of cooler atmospheric conditions and increased pre- 
cipitation’. Although the broad pattern of change across Greenland 
(Fig. 1) is one of ice loss, there is considerable variability; for example, 
during the 2000s just four glaciers were responsible for half of the total 
ice loss due to increased discharge’, whereas many others contribute 
today*®. Moreover, some neighbouring ice streams have been observed 
to speed up over this period while others slowed down®, suggesting 
diverse reasons for the changes that have taken place—including their 
geometrical configuration and basal conditions, as well as the forcing 
they have experienced*®. In this study we combine satellite altimetry, 
gravimetry and ice velocity measurements to produce a reconciled 
estimate of the Greenland Ice Sheet mass balance between 1992 and 
2018, we evaluate the impact of changes in SMB and uncertainty in 
glacial isostatic adjustment and we partition the ice sheet mass loss 
into signals associated with surface mass balance and ice dynamics. In 
doing so, we extend a previous assessment’ to include more satellite 
and ancillary data and to cover the period since 2012. 


*A list of participants and their affiliations appears at the end of the paper 
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Fig. 1| Greenland Ice Sheet elevation change. Rate of elevation change of the Greenland Ice Sheet determined from ERS, ENVISAT and CryoSat-2 satellite radar 
altimetry (top row) and from the HIRHAMS SMB model (ice equivalent; bottom row) over successive 5-yr epochs. Data from ref.”’. 


Data and methods 


We use 26 estimates of ice sheet mass balance derived from satellite 
altimetry (9 datasets), satellite gravimetry (14 datasets) and the input- 
output method (3 datasets) to assess changes in the Greenland Ice 
Sheet mass balance. The satellite data were computed using common 
spatial?°*” and temporal domains, and a range of models to estimate 
signals associated with changes in SMB and glacial isostatic adjust- 
ment. Satellite altimetry provides direct measurements of changing 
ice sheet surface elevation recorded at orbit crossing points”, along 
repeated ground tracks” or using plane-fit solutions”®. The ice sheet 
mass balance is estimated from these measurements either by pre- 
scribing the density of the elevation fluctuation*® or by making an 
explicit model-based correction for changes in firn height”. Satellite 
gravimetry measures fluctuations in the Earth’s gravitational field 
computed using either global spherical harmonic solutions” or using 
spatially discrete mass concentration units”. Ice sheet mass changes 
are determined after making model-based corrections for glacial iso- 
static adjustment*’. The input-output method uses model estimates 
of SMB’ (the input) and satellite observations of ice sheet velocity 
computed from radar® and optical* imagery combined with airborne 
measurements of ice thickness® to compute changes in marine-ter- 
minating glacier discharge into the oceans (the output). The overall 
mass balance is the difference between the input and output. Not all 
annual surveys of ice sheet discharge are complete, and sometimes 
regional extrapolations have to be employed to account for gaps in 
coverage™. Because they provide important ancillary data, we also 
assess six models of glacial isostatic adjustment and ten models of 
surface mass balance. 
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To compare and aggregate the individual satellite datasets, we first 
adopt acommon approach to derive linear rates of ice sheet mass bal- 
ance over 36-month intervals (see Methods). We then compute error- 
weighted averages of all altimetry, gravimetry and input-output group 
mass trends, and combine these into a single reconciled estimate of 
the ice sheet mass balance using error-weighting of the group trends. 
Uncertainties in the individual rates of mass change are estimated as 
the root sum square of the linear model misfit and their measurement 
error, uncertainties in the group rates are estimated as the root mean 
square of the contributing time-series errors and uncertainties in the 
reconciled rates are estimated as their root mean square error divided 
by the square root of the number of independent groups. Cumulative 
uncertainties are computed as the root sum square of annual errors, 
an approach that has been employed in numerous studies”?! and 
assumes that annual errors are not correlated over time. To improve 
on this assumption, it is necessary to consider the covariance of the 
systematic and random errors present in each mass balance solution 
(see Methods). 


Intercomparison of satellite and model results 

The satellite gravimetry and satellite altimetry data used in our assess- 
ment are corrected for the effects of glacial isostatic adjustment, 
although the correction is relatively small for altimetry as it manifests as 
a change in elevation and not mass. The most prominent and consistent 
local signals of glacial isostatic adjustment among the six models we 
considered are two instances of uplift peaking at about 5-6 mm yr“, 
one centred over northwest Greenland and Ellesmere Island, and one 
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Fig. 2| Greenland Ice Sheet mass balance. Rate of mass change (dM/dt, where 
Mismass and ¢ttime) of the Greenland Ice Sheet determined from the satellite- 
altimetry, input-output method and gravimetry assessments included inthis 
study. In each case, dM/dtis computed at annual intervals from time series of 
relative mass change using a 3-yr window. An average of the estimates across 
each measurement technique is also shown for each year (black line). The 


over northeast Greenland (see Methods and Extended Data Fig. 3). 
Although some models identify a 2 mm yr“ subsidence under large 
parts of the central and southern parts of the ice sheet, it is absent 
or of lower magnitude in others, which suggests that it is less certain 
(Extended Data Table 1). The greatest difference among model solutions 
is at Kangerlussuaq Glacier in the southeast, wherea study” has shown 
that models and observations agree if a localized weak Earth structure 
associated with overpassing the Iceland hotspot is assumed; the effect 
is to offset earlier estimates of mass trends associated with glacial iso- 
static adjustment by about 20 Gtyr“. Farther afield, the highest spread 
between modelled uplift occurs on Baffin Island and beyond due to 
variations in regional model predictions related to the demise of the 
Laurentide Ice Sheet”. This regional uncertainty is probably a major 
factor in the spread across the ice-sheet-wide estimates. Neverthe- 
less, at -—3 +20 Gt yr‘, the mass signal associated with glacial isostatic 
adjustment in Greenland shows no coherent substantive change and 
is negligible relative to reported ice sheet mass trends!. 

There is generally good agreement between the models of Greenland 
Ice Sheet SMB that we have assessed for determining mass input— 
particularly those of a similar class; for example, 70% of all model 
estimates of runoff and accumulation fall within lo of their mean 
(see Methods and Extended Data Table 2). The exceptions area global 
reanalysis with coarse spatial resolution that tends to underestimate 
runoff due to its poor delineation of the ablation zone, and a snow 
process model that tends to underestimate precipitation and to over- 
estimate runoff in most sectors. Among the other eight models, the 
average surface mass balance between 1980 and 2012is 361+40Gtyr?, 
with a marked negative trend over time (Extended Data Fig. 4) that is 
mainly due to increased runoff’. At the regional scale, the largest dif- 
ferences occur in the northeast, where two regional climate models 
predict considerably less runoff, and in the southeast, where there 
is considerable spread in precipitation and runoff across all models. 
All models show high temporal variability in SMB components, and 
all models show that the southeast receives the highest net intake of 
mass at the surface due to high rates of snowfall originating from the 
Icelandic Low’. By contrast, the southwest, which features the wid- 
est ablation zone’, has experienced alternate periods of net surface 
mass loss and gain over recent decades, and has the lowest average 
SMB across the ice sheet. 

We assessed the consistency of the satellite altimetry, gravimetry and 
input-output method estimates of Greenland Ice Sheet mass balance 
using common spatial and temporal domains (see Fig. 2and Methods). 
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estimated lo, 20and 3oranges of the class average are shaded in dark, mid and 
light grey, respectively; 97% of all estimates fall within the lorange, given their 
estimated individual errors. The equivalent sea-level contribution of the mass 
change isalso indicated (right vertical axis), and the number of individual mass- 
balance estimates collated at each epochis shown below each bar. 


In general, there is close agreement between estimates determined 
using each approach, and the standard deviations of annual mass bal- 
ance solutions from the coincident altimetry, gravimetry and input- 
output methods are 42, 31 and 23 Gt yr“, respectively (Extended Data 
Table 3). Once averages were computed for each technique, the result- 
ing estimates of mass balance were also closely aligned (Extended Data 
Fig. 6). For example, over the common period 2005-2015, the average 
Greenland Ice Sheet mass balance is -254 +18 Gt yr‘ and, by compari- 
son, thespread of the altimetry, gravimetry and input-output method 
estimates is just 36 Gt yr‘ (Extended Data Table 3). The estimated uncer- 
tainty ofthe aggregated mass balance solution (see Methods) is larger 
than the standard deviation of model corrections for glacial isostatic 
adjustment (20 Gt yr‘ for gravimetry) and for surface mass balance 
(40 Gt yr’), which suggests that their collective impacts have been 
adequately compensated; itis also larger than the estimated 30 Gt yr 
mass losses from peripheral ice caps“, which are not accounted for in 
allindividual solutions. In keeping with results from Antarctica“, rates 
of mass loss determined using the input-output method are the most 
negative, and those determined from altimetry are the least negative. 
However, the spread among the three techniques is six times lower for 
Greenland than it is for Antarctica”, reflecting differences in ice sheet 
size, the complexity of the mass balance processes and the limitations 
of the various geodetic techniques. 


Ice sheet mass balance 


We aggregated the average mass balance estimates from gravimetry, 
altimetry and the input-output method to forma single, time-varying 
record (Fig. 2) and then integrated these data to determine the 
cumulative mass lost from Greenland since 1992 (Fig. 3). Although 
Greenland has been losing ice throughout most of the intervening 
period, the rate of loss has varied considerably. The rate of ice loss 
progressively increased between 1992 and 2012, reaching a maximum 
of 345 + 66 Gt yr‘ in 2011, ahead of the extreme summertime surface 
melting that occurred in the following year". Since 2012, however, 
the trend has reversed, with a progressive reduction in the rate of 
mass loss during the subsequent period. By 2018—the last complete 
year of our survey—the annual rate of ice mass loss had reduced to 
85+ 75 Gtyr”. The highly variable nature of ice losses from Green- 
land is aconsequence of the wide range of physical processes that 
are affecting different sectors of the ice sheet!?>5, which suggests 
that care should be taken when extrapolating measurements that 
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Fig. 3| Cumulative anomalies inthe total mass, SMB and ice dynamics of the 
Greenland Ice Sheet. The total change is determinedas the integral of the 
average rate of ice sheet mass change (Fig. 2). The change in SMB is determined 
from three regional climate models relative to their mean over the period 
1980-1990. The change associated with ice dynamics is determined as the 
difference between the change in total and surface mass. The estimated lo 


are sparse in space or time. Although the rates of mass loss we have 
computed between 1992 and 2011 are 16% less negative than those of 
a previous assessment, which included far fewer datasets’, the results 
are consistent given their respective uncertainties. Altogether, the 
Greenland Ice Sheet has lost 3,902 + 342 Gt of ice to the ocean since 
1992, with roughly half of this loss occurring during the 6-yr period 
between 2006 and 2012. 

To determine the proportion of mass lost due to surface and ice 
dynamical processes, we computed the contemporaneous trend in 
Greenland Ice Sheet surface mass balance—the net balance between 
precipitation and ablation’, which is controlled by interactions with 
the atmosphere (Fig. 3). In Greenland, recent trends in surface mass 
balance have been largely driven by meltwater runoff, which has 
increased as the regional climate has warmed”. As direct observations 
of ice sheet surface mass balance are too scarce to provide full temporal 
and spatial coverage*, regional estimates are usually taken from atmos- 
pheric models that are evaluated with existing observations. Our evalu- 
ation (see Methods) shows that the finer-spatial-resolution regional 
climate models produce consistent results, probably due to their abil- 
ity to capture local changes in melting and precipitation associated 
with atmospheric forcing, and to resolve the full extent of the ablation 
zone*°. We therefore compare and combine estimates of Greenland 
SMB derived from three regional climate models: RACMO2.3p2*, 
MARv3.67 and HIRHAM”. To assess the surface mass change across 
the Greenland Ice Sheet between 1980 and 2018, we accumulate SMB 
anomalies from each of the regional climate models (Extended Data 
Fig. 7) and average them into a single estimate (Fig. 3). These SMB 
anomalies are computed with respect to the average between 1980 and 
1990, which corresponds to a period of approximate balance® and is 


uncertainties of the cumulative changes are shown by the shaded envelopes. 
The dotted line shows the result of a previous assessment’. The equivalent sea- 
level contribution of the mass changeis also indicated (right vertical axis). 
Vertical dashed lines mark consecutive 5-yr epochs since the start of our 
satellite record in1992. The IMBIE 2012 datais from ref. '. 


common toall models. Inthis comparison, all three models show that 
the Greenland Ice Sheet entered abruptly into a period of anomalously 
low SMB in the late 1990s and, when combined, they show that the ice 
sheet lost 1,964 + 565 Gt of its mass due to meteorological processes 
between 1992 and 2018 (Table 1). 

Just over half (50.3%) ofall mass losses from Greenland—and much 
of their short-term variability—have been due to variations in the ice 
sheet’s SMB and its indirect impacts on firn processes. For example, 
between 2007 and 2012, 70% of the total ice loss (193 + 37 Gt yr“) was 
due to SMB, compared with 27% (22 + 20 Gt yr“) over the preceding 
15 years and 57% (139 + 38 Gt yr”) since then (Table 1). The rise in the 
total rate of ice loss during the late 2000s coincided with warmer 
atmospheric conditions, which promoted several episodes of wide- 
spread melting and runoff". The reduction in surface mass loss since 
thenis associated with a shift of the North Atlantic Oscillation, which 
brought about cooler atmospheric conditions and increased precipita- 
tion along the southeastern coast”. Trends in the total ice sheet mass 
balance are not entirely due to surface mass balance, however, and by 
calculating the difference between these two signals, we can estimate 
the total change in mass loss due to ice dynamical imbalance—thatis, 
the integrated net mass loss from those glaciers whose velocity does 
not equal their long-term mean (Fig. 3). Although this approach is indi- 
rect, it makes use of all the satellite observations and regional climate 
models included in our study, overcoming limitations inthe spatial and 
temporal sampling of ice discharge estimates derived from ice velocity 
and thickness data. Our estimate shows that, between 1992 and 2018, 
Greenland lost 1,938 + 541 Gt of ice due to the dynamical imbalance 
of glaciers relative to their steady state, accounting for 49.7% of the 
total imbalance (Table 1). Losses due to increased ice discharge rose 


Table 1 | Rates of Greenland Ice Sheet total, surface and dynamical mass change 


Region 1992-1997 (Gtyr") 1997-2002 2002-2007 2007-2012 2012-2017 (Gtyr") 1992-2011(Gtyr’) 1992-2018 
(Gtyr’) (Gtyr") (Gtyr") (Gtyr’) 
Total -26 +27 -44+35 -174 +30 -275 +28 -244 + 28 -119 +16 -150 +13 
Surface 26 +35 -15 +36 -78 +36 -193 +37 -139 +38 -57+18 -16 +16 
Dynamics -52+44 -29 +50 -96+47 -82+46 -105 + 47 -62424 -75 +21 


Total rates were determined from all satellite measurements over the various epochs, rates of surface mass change were determined from three regional climate models and rates of dynamical 


mass change were determined as the difference between the two. The period 1992-2011 is included for comparison with a previous assessment’, which reported a mass balance estimate of 


-142 + 49 Gt yr" based on far fewer data. The small differences in our updated estimate are due to our inclusion of more data and an updated aggregation scheme (see Methods). Errors are 10. 
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Fig. 4| Observed and predicted sea-level contributions from Greenland Ice 
Sheet mass change. The global sea-level contribution from Greenland Ice 
Sheet mass change according to this study and the IPCC ARS projections 
between 1992-2040 (left) and 2040-2100 (right) including upper, mid and 
lower estimates from the sum of modelled SMB and rapid ice dynamical 
contributions. Darker lines represent pathways from the five ARS scenariosin 


sharply in the early 2000s when Jakobshavn Isbre”’ and several other 
outlet glaciers in the southeast” sped up, and the discharge losses 
are now four times higher than in the 1990s. For the period between 
2002 and 2007, ice dynamical imbalance was the major source of ice 
loss from the ice sheet as a whole, although the situation has since 
returned to being dominated by surface mass losses as several glaciers 
have slowed down”. 

Despite a reduction in the overall rate of ice loss from Greenland 
between 2013 and 2018 (Fig. 2), the ice sheet mass balance remained 
negative, adding 10.8+ 0.9 mm to global sea level since 1992. Although 
the average sea level contribution is 0.42 + 0.04 mm yr“, the 5-yr 
average rate varied by a factor of 5 over the 25-yr period, peaking at 
0.76 + 0.08 mm yr‘ between 2007 and 2012. The variability in ice loss 
from Greenland illustrates the importance of accounting for annual 
fluctuations when attempting to close the global sea-level budget?. 
Satellite records of ice sheet mass balance are also an important tool 
for evaluating numerical models of ice sheet evolution*®. In their 2013 
assessment, the Intergovernmental Panel on Climate Change (IPCC) 
predicted ice losses from Greenland due to SMB and glacier dynam- 
ics under a range of scenarios, beginning in 2007” (Fig. 4). Although 
ice losses from Greenland have fluctuated considerably during the 
12-yr period of overlap between the IPCC predictions and our recon- 
ciled time series, the total change and average rate (0.70 mmyr") are 
close to the upper range of predictions (0.72 mm yr ), which implies 
70-130 mm of sea-level rise by 2100 above central estimates. The drop 
in ice losses between 2013 and 2018, however, shifted rates towards 
the lower end of projections, and a longer period of comparison is 
required to establish whether the upper trajectory will continue to 
be followed. Even greater sea-level contributions cannot be ruled out 
if feedbacks between the ice sheet and other elements of the climate 
system are underestimated by current ice sheet models’. Although the 
volume of ice stored in Greenland is a small fraction of that in Antarc- 
tica (12%), its recent losses have been ~38% higher“! as a consequence 
of the relatively strong atmospheric’ and oceanic’°™ warming that 
has occurred inits vicinity, and it is expected to continue to bea major 
source of sea-level rise”. 


1995 2000 2005 2010 2015 2020 2025 2030 2035 2040 


2040 2100 


Year 


order of increasing emissions: RCP2.6, RCP4.5, RCP6.0, SRES A1B and RCP8.5. 
Shaded areas represent the spread of ARS emissions scenarios andthelo 
estimated error onthe IMBIE data (this study). Inset, the average annual rates of 
sea-level rise during the overlap period 2007-2018 and their standard 
deviations (error bars). Cumulative ARS projections have been offset to make 
them equal tothe observational record at their start date (2007). 


Conclusions 


We combine 26 satellite estimates of ice sheet mass balance and assess 
10 models of ice sheet SMB and 6 models of glacial isostatic adjustment 
to show that the Greenland Ice Sheet lost 3,902 + 342 Gt of ice between 
1992 and 2018. Duringthe common period 2005-2015, the spread of mass 
balance estimates derived from three techniquesis36Gtyr“, or 14% ofthe 
estimated rate of imbalance. The rate of ice loss has generally increased 
over time, rising from 26 + 27 Gt yr between 1992 and 1997, peaking at 
275 +28 Gt yr between 2007 and 2012, and reducing to 244 + 28Gt yr* 
between 2012 and 2017. Just over half (1,964 + 565 Gt, or 50.3%) of theice 
losses are due to reduced SMB (mostly meltwater runoff) associated with 
changing atmospheric conditions”, and these changes have also driven 
theshorter-term temporal variability inice sheet mass balance. Despite 
variations in the imbalance of individual glaciers*>”’, ice losses due to 
increasing discharge from the ice sheet as a whole have risen steadily 
from46+37 Gtyr ‘inthe 1990s to 87+ 25 Gt yr ‘since then, and account 
for just under half ofall losses (49.7%) over the survey period. 

Our assessment shows that estimates of Greenland Ice Sheet mass 
balance derived from satellite altimetry, gravimetry and the input- 
output method agree to within 20 Gt yr‘, that model estimates of SMB 
agree to within 40 Gt yr‘ and that model estimates of glacial isostatic 
adjustment agree to within 20 Gt yr“. These differences represent a 
small fraction (13%) of the Greenland Ice Sheet mass imbalance and are 
comparable to its estimated uncertainty (13 Gtyr“). Nevertheless, there 
is still disagreement among models of glacial isostatic adjustment in 
northern Greenland. Spatial resolution is a key factor in the degree to 
which models of SMB can represent ablation and precipitation at local 
scales, andestimates of ice sheet mass balance determined from satel- 
lite altimetry and the input-output method continueto be positively 
and negatively biased, respectively, compared with those based on 
satellite gravimetry (albeit by small amounts). More satellite estimates 
of ice sheet mass balance at the start (1990s) and end (2010s) of our 
record would help to reduce the dependence on fewer data during 
those periods; although new missions*”~ will no doubt address the 
latter period, further analysis of historical satellite data are required 
to address the paucity of data during the early years. 
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Methods 


Data 

In this assessment, we analyse five groups of data: estimates of ice 
sheet mass-balance determined from three distinct classes of satellite 
observations (altimetry, gravimetry and the input-output method 
(IOM)) and model estimates of SMB) and glacial isostatic adjustment 
(GIA). Each dataset is computed following previously reported methods 
(based onrefs. 385!" and detailed in Supplementary Table 1) and, for 
consistency, they are aggregated withincommon spatial and temporal 
domains. Altogether, 26 separate ice sheet mass balance datasets were 
used—9 derived from satellite altimetry, 3 from the IOM and 14 from 
satellite gravimetry—with a combined period running from 1992 to 
2018 (Extended Data Fig. 1). We also assess six model estimates of GIA 
(Extended Data Table 1) and ten model estimates of SMB (Extended 
Data Table 2). 


Drainage basins 

We analyse mass trends using two ice sheet drainage basin sets 
(Extended Data Fig. 2) for consistency with those used in the first 
IMBIE assessment! and to evaluate an updated definition tailored 
towards mass budget assessments. The first set comprises 19 drainage 
basins delineated using surface elevation maps derived from ICESat-1 
with a total area of 1,703,625 km? (ref. 2°). The second drainage basin 
set is an updated definition that considers other factors such as the 
direction of ice flow and includes 6 basins with a combined area of 
1,723,300 km? (ref. *”). The two drainage basin sets differ by 1% in area 
at the scale of the Greenland Ice Sheet, and this has a negligible impact 
on mass trends when compared to the estimated uncertainty of indi- 
vidual techniques. 


Glacial isostatic adjustment 

GIA (the delayed response of Earth’s interior to temporal changes in 
ice loading) affects estimates of ice sheet mass balance determined 
from satellite gravimetry and, toa lesser extent, satellite altimetry”. 
Here, we compare six independent models of GIA in the vicinity of the 
Greenland Ice Sheet (Extended Data Table 1). The GIA model solutions 
we considered differ for a variety of reasons, including differences in 
their physics, in their computational approach, in their prescriptions 
of solid Earth unloading during the last glacial cycle and their Earth rhe- 
ology, and in the datasets against which they are evaluated. Although 
alternative ice histories (for example, ref. *) and mantle viscosities (for 
example, ref. *) are available, we restricted our comparison to those 
that contributed to our assessment. No approach is generally accepted 
as optimal, and so we evaluate the models by computing the mean 
and standard deviation of their predicted uplift rates (Extended Data 
Fig. 3). We also estimate the contribution of each model to gravimetric 
mass trends using acommon processing approach" that puts special 
emphasis on the treatment of low spherical harmonic degrees in the 
GIA-related trends in the gravitational field. 

The highest rates of GIA-related uplift occur in northern Greenland, 
although this region also exhibits marked variability among the solu- 
tions, as does the area around Kangerlussuaq Glacier to the southeast. 
Eventhough the model spread is high innorthern Greenland, the signal 
inthis sector is also consistently high in most solutions. However, none 
of the GIA models considered here fully captures all areas of high uplift 
present in the models, and so it is possible there is a bias towards low 
values in the average field across the ice sheet overall. The models 
yield an average adjustment for GRACE estimates of the Greenland Ice 
Sheet mass balance of —3 Gt yr“, with a standard deviation of around 
20Gtyr™. The spread is probably due, in part, to differences in the way 
each model accounts for GIA in North America (which is ongoing and 
impacts western Greenland), and so care must betaken when estimat- 
ing mass balance at the basin scale. Local misrepresentation of the 
solid Earth response canalso havea relatively large impact stemming 


especially from lateral variations of solid Earth properties”, and revi- 
sions of the current state of knowledge can be expected™. 


SMB 

Here, ice-sheet SMB is defined as total precipitation minus sublima- 
tion, evaporation and meltwater runoff; that is, the interaction of 
the atmosphere and the superficial snow and firn layers, for example 
through mass exchanges via precipitation, sublimation and runoff, 
and through mass redistribution by snowdrift, melting and refreezing. 
We compareten estimates of Greenland Ice Sheet SMB derived using a 
range of alternative approaches; four regional climate models (RCMs), 
two downscaled RCMs, a global reanalysis, two downscaled model 
reanalyses of climate data and one gridded model of snow processes 
driven by climate model output (Extended Data Table 2). 

Although SMB models of similar classes tend to produce similar 
results, there are larger differences between classes—most notably 
the global reanalysis and the process model—which lead to estimates 
of SMB that are substantially higher and lower than all other solutions, 
respectively. The regional climate model solutions agree well at the 
scale of individual drainage sectors, with the largest differences occur- 
ring innortheast Greenland (Extended Data Fig. 4). The snow process 
model tends to underestimate SMB when compared with the other 
solutions that we have considered in various sectors of the ice sheet, 
at times even yielding negative SMB, while the global reanalysis tends 
to overestimate it. 

Across all models, the average SMB of the Greenland Ice Sheet 
between 1980 and 2012 is 351 Gt yr‘ and the standard deviation is 
98 Gt yr. However, the spread among the 8 RCM and downscaled 
reanalyses is considerably smaller; these solutions lead to an average 
Greenland Ice Sheet SMB of 361 Gt yr! with a standard deviation of 
40 Gt yr’ over the same period. By comparison, the global reanalysis 
and process model lead to ice-sheet-wide estimates of SMB that are 
considerably larger (504 Gt yr) and smaller (125 Gt yr“) than this range, 
respectively. Model resolution is an important factor when estimating 
SMB and its components, as respective contributions where only the 
spatial resolution differed yield regional differences. The underlying 
model domains were also identified as a source of discrepancy inthe 
case of the Greenland Ice Sheet, as some products would allocate the 
ablation area outside the given mask. 


Individual estimates of ice sheet mass balance 

To standardize our comparison and aggregation of the 26 individual 
satellite estimates of Greenland Ice Sheet mass balance, we applied a 
common approach to derive rates of mass change from cumulative 
mass trends”. Rates of mass change were computed over 36-month 
intervals centred on regularly spaced (monthly) epochs within each 
cumulative mass trend time series, oversampling the individual time 
series where necessary. At each epoch, rates of mass change were esti- 
mated by fitting a linear trend to data within the surrounding 36-month 
time window using a weighted least-squares approach, with each point 
weighted by its measurementerror. The associated mass trend uncer- 
tainties were estimated as the root sum square of the regression error 
and the measurement error. Time series were truncated by half the 
moving-average window period at the start and end of their period. 
The emerging rates of mass change were then averaged over calendar 
years to reduce the impact of seasonal cycles. 


Gravimetry. We include 14 estimates of Greenland Ice Sheet ice sheet 
mass balance determined from GRACE satellite gravimetry that to- 
gether span the period 2003- 2016 (Extended Data Fig. 1). Ten of the 
gravimetry solutions were computed using spherical harmonic solu- 
tionsto the global gravity field and four were computed using spatially 
defined mass concentration units (Supplementary Table 1). An unre- 
stricted range of alternative GIA corrections were used inthe formation 
of the gravimetry mass balance solutions based on commonly adopted 


model solutions and their variants***! °’ (Supplementary Table 1). All 
of the gravimetry mass balance solutions included in this study use the 
same degree-1 coefficients to account for geocentre motion® and, al- 
though analternative set is now available”, the estimated improvement 
in certainty is small in comparison with their magnitude and spread. 
There was some variation in the sampling of the individual gravim- 
etry datasets, and their collective effective (weighted mean) temporal 
resolution is 0.08 yr. Overall, there is good agreement between rates 
of Greenland Ice Sheet mass change derived from satellite gravimetry 
(Extended Data Fig. 5); all solutions showthe ice sheet to beina state of 
negative mass balance throughout their survey periods, with mass loss 
peakingin 2011 and reducing thereafter. During the period 2005-2015, 
annual rates of mass change determined from satellite gravimetry 
differ by 104 Gt yr“ on average, and their average standard deviation 
is 31 Gt yr (Extended Data Table 3). 


Altimetry. We include nine estimates of Greenland Ice Sheet mass bal- 
ance determined from satellite altimetry that together span the period 
2004-2018 (Extended Data Fig. 1). Three of the solutions are derived 
from radar altimetry, four from laser altimetry and two use a combina- 
tion of both (Supplementary Table 1). The altimetry mass trends arealso 
computed using a range of approaches, including crossovers, planar 
fits and repeat track analyses. The laser altimetry mass trends arecom- 
puted from ICESat-1 data as constant rates of mass change over their 
respective survey periods, whereas the radar altimetry mass trends 
are computed from EnviSat and/or CryoSat-2 data with a temporal 
resolution of between land 72 months. In consequence, the altimetry 
solutions have an effective collective temporal resolution of 0.74 yr. 
Mass changes are computed after making corrections for alternative 
sources of surface elevation change, including glacial isostatic and 
elastic adjustment, and firn height changes (see Supplementary Ta- 
ble1). Despite the range of input data and technical approaches, there 
is good overall agreement between rates of mass change determined 
from the various satellite altimetry solutions (Extended Data Fig. 5). 
Allaltimetry solutions show the Greenland Ice Sheet to be ina state of 
negative mass balance throughout their survey periods, with mass loss 
peakingin 2012 and reducing thereafter. During the period 2005-2015, 
annual rates of mass change determined from satellite altimetry dif- 
fer by 121 Gt yr‘ on average, and their average standard deviation is 
42 Gt yr (Extended Data Table 3). The greatest variance lies among 
the 4 laser altimetry mass balance solutions, which range from —248 
to -128 Gt yr‘ between 2004 and 2010; aside from methodological 
differences; possible explanations for this high spread include the 
relatively short period over which the mass trends are determined, 
the poor temporal resolution of these datasets and the rapid change 
in mass balance occurring during the period in question. 


IOM. We include three estimates of Greenland Ice Sheet mass balance 
determined from the IOM that together span the period 1992-2015 
(Extended Data Fig. 1). Although there are relatively few datasets in 
comparison with the gravimetry and altimetry solutions, the input- 
output data provide information on the partitioning of the mass change 
(surface processes and/or ice dynamics) cover a considerably longer 
period and are therefore an important record of changes in Greenland 
Ice Sheet mass during the 1990s. The IOM makes use of a wide range of 
satellite imagery (for example, refs. °*°*” 1°) combined with measure- 
ments of ice thickness (for example, ref. !) for computing ice sheet 
discharge (output), and several alternative SMB model estimates of 
snow accumulation (input) and runoff (output) (see Supplementary 
Table 1). Two of the IOM datasets exhibit temporal variability across 
their survey periods, and ywo provide only constant rates of mass 
changes. Although these latter records are relatively short, they are 
an important marker with which variances among independent es- 
timates can be evaluated. The collective effective (weighted mean) 
temporal resolution of the IOM dataare 0.14 yr, although it should be 


noted thatin earlier years the satellite ice discharge component of the 
data are relatively sparsely sampled in time (for example, ref. !*).There 
is good overall agreement between rates of mass change determined 
from the input-output method solutions (Extended Data Fig. 5). During 
the period 2005-2015, annual rates of mass change determined from 
the four input-output datasets differ by up to 48 Gt yr‘ on average, 
and their average standard deviation is 23 Gt yr“ (Extended Data Ta- 
ble3). These differences are comparable to the estimated uncertainty 
of the individual techniques and arealso small relative to the estimated 
mass balance over the period in question. In addition to showing that 
the Greenland Ice Sheet was in a state of negative mass balance since 
2000, with mass loss peaking in 2012 and reducing thereafter, the IOM 
data show that the ice sheet was close to a state of balance before this 
period”. 


Aggregate estimate of ice sheet mass balance 
To produce an aggregate estimate of Greenland Ice Sheet mass bal- 
ance, we combine the 14 gravimetry, 9 altimetry and 3IOM datasets to 
producea single 26-yr record spanning the period 1992-2018. First, we 
combine the gravimetry, altimetry and the IOM data separately into 
three monthly time series by forming an error-weighted average of 
individual monthly rates of ice sheet mass change computed using the 
same technique (Extended Data Fig. 6). At each epoch, we estimate the 
uncertainty of these time-series as the root mean square of their compo- 
nent time-series errors. Wethen combine the mass balance time series 
derived from gravimetry, altimetry and the IOM to produce a single 
aggregate (reconciled) estimate, computed as the error-weighted mean 
of mass trends sampled at each epoch. We estimated the uncertainty 
of this reconciled rate of mass balance as either the root mean square 
departure of the constituent mass trends from their weighted-mean or 
theroot mean square of their uncertainties, whichever is larger. Cumu- 
lative uncertainties are computed as the root sum square of annual 
errors, on the assumption that annual errors are not correlated over 
time. This assumption has been employed in numerous mass balance 
studies!”31, and its effect is to reduce cumulative errors by a factor 
of 2.2 over the 5-yr periods we employ in this study (Table 1). If some 
sources of error are temporally correlated, the cumulative uncertainty 
may therefore be underestimated. Ina recent study, for example, it is 
estimated that 30% of the annual mass balance error is systematic’, 
and in this instance the cumulative error may be 37% larger. On the other 
hand, the estimated annual error on aggregate mass trends reportedin 
this study (61 Gt yr“) are 70% larger than the spread of the independent 
estimates from which they are combined (36 Gt yr“) (Extended Data 
Table 3), which suggests the underlying errors may be overestimated 
by asimilar degree. A more detailed analysis of the measurement and 
systematic errors is required to improve the cumulative error budget. 
During the period 2004-2015, when all three satellite techniques 
were in operation, there is good agreement between changes in ice 
sheet mass balance on a variety of timescales (Extended Data Fig. 6). 
In Greenland, there are large annual cycles in mass superimposed on 
equally prominent interannual fluctuations as well as variations of inter- 
mediate (-5 yr) duration. These signals are consistent with fluctuations 
in SMB that have been identified in meteorological records!”, and are 
present within the time series of mass balance emerging from all three 
satellite techniques, to varying degrees, according to their effective 
temporal resolution. For example, correlated seasonal cycles are appar- 
ent inthe gravimetry and IOM mass balance time series, because their 
effective temporal resolutions are sufficiently short (0.08 and 0.14 yr, 
respectively) to resolve such changes. However, at 0.74 yr, the effective 
temporal resolution of the altimetry mass balance time series is too 
coarse to detect cycles on sub-annual timescales. Nevertheless, when 
the aggregated mass balance data emerging fromall three experiment 
groups are degraded toa common temporal resolution of 36 months, 
the time series are well correlated (0.63 <r? < 0.80) and, over longer 
periods, all techniques identify the marked increases in Greenland 
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Ice Sheet mass loss peaking in 2012. During the period 2005-2015, 
annual rates of mass change determined from all three techniques 
differ by up 162 Gt yr‘ on average, and their average standard deviation 
is 41 Gt yr ‘—a value that is small when compared with their estimated 
uncertainty (18 Gt yr‘) (Extended Data Table 3). 


Data availability 

The aggregated Greenland Ice Sheet mass balance dataand estimated 
errors generated inthis study are freely available at http://imbie.org 
and atthe NERC Polar Data Centre, https://doi.org/10.5285/8D5FF221- 
A470-4CC1-B7C4-CBDF383554FC. 


Code availability 


The code usedto compute and aggregate rates of ice sheet mass change 
and their estimated errors are freely available at https://github.com/ 
IMBIE. 
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Extended Data Fig. 1|Ice sheet mass balance datasets. a, Participant datasets 
used inthis study and their main contributors. b, The number of data available 
ineach calendar year. The interval 2003-2010 includes almost all datasets and 
is selected as the overlap period. Further details of the satellite observations 
used in this study are provided in Supplementary Table 1. Refs, 7833-385¢ 5°71, 82-90, 


Extended Data Fig. 2 | Greenland Ice Sheet drainage basins. a, b, Basin used in this study, according to the definitions of ref.”° (a) and ref. * (b). 
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Extended Data Fig. 3 | Modelled glacialisostatic adjustment inGreenland.a, | models used inthis study are providedin Extended Data Table 1. High rates of 
b, Bedrock uplift rates in Greenland averaged over the GIA model solutions uplift and subsidence associated with the former Laurentide Ice Sheet are 
used in this study (a) and their standard deviation (b).FurtherdetailsoftheGIA | apparenttothesouthwest of Greenland. 
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Extended Data Fig. 4 | SMB of the Greenland Ice Sheet. a-f, Time series of SMB 
inthe NW, CW, SW, SE, NE and NO Greenland Ice Sheet drainage basins 
(Extended Data Fig. 2)'°*"°. Solid lines are annual averages of the monthly data 
(dashed lines). Further details of the SMB models used in this study are 
provided in Extended Data Table 2. 
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Extended Data Fig. 5 | Greenland Ice Sheet mass balance intracomparison. 
a-c, Individual rates of Greenland Ice Sheet mass balance used in this study as 
determined from satellite altimetry (a), gravimetry (b) and the input-output 
method (c). The grey shading shows the estimated 1o (dark), 20 (mid-) and 30 
(light) uncertainty relative to the ensemble average. Refs. 23785659-71.82-90, 
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Rate of Greenland Ice Sheet mass balance as derived fromthe threetechniques: uncertainty is also shown (shaded envelopes) and is computed as the root 
satellite radar and laser altimetry (red), input-output method (blue) and mean square of the component time-series errors. 
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Extended Data Fig. 7 | Cumulative Greenland Ice Sheet SMB. The cumulative 
surface mass change determined from an average (mean) of the 
RACMO2.3p2**, MARV3.67 and HIRHAM? regional climate models relative to 
their 1980-1990 means (see Methods). The estimated uncertainty of the mean 
change is also shown (shaded area), computed as the average of the 
uncertainties from each of the three models. RACMO2.3p2 uncertainties are 


based onacomparison to in situ observations**. MARV3.6 uncertainties are 
evaluated from the variability due to forcing from climate reanalyses”. 
HIRHAM uncertainties are estimated onthe basis of comparisons toin situ 
accumulation and ablation data”. Cumulative uncertainties are computed as 
the root sum square of annual errors, onthe assumption that these errors are 
not correlated over time”. 


Extended Data Table 1| Details of GIA models used in this study 


tRegional changes in mass associated with the GIA signal determined by the contributor. 


+Regional changes in mass associated with the GIA signal calculated as an indicative rate using spherical-harmonic degrees 3 to 90 and a common treatment of degree 2 (ref. '°°). 


*Main reference publication(s). 


’Model from main publication unless otherwise stated. Comma-separated values refer to properties of a radially varying (1D, one-dimensional) Earth model: the first value is lithosphere thick- 


ness (km), other values reflect mantle viscosity (x 10” Pa s) for specific layers; see relevant publication. 


°GIA model details: SH = spherical harmonic (maximum degree indicated), FE = finite element, C = compressible, IC = incompressible, RF = rotational feedback, SG = self-gravitation, OL = ocean 


loading, ‘x’ = feature not included. 


‘RSL = relative sea-level data; GPS rates corrected for elastic response to contemporary ice mass change. 


°Earth model taken from ref. *'. 
‘lce model taken from ref. *'. 


8Different to ICE-6G_C in Antarctica, owing to the use of Bedmap2"” topography. 


Ice Constraint GIA 
; mare (Gt/yr) 
Contributor Model Publication # model » data4 
A A13 Aetal., 2013 VM5a (1D) & ICE-6G_Cf SH, C, RF, As for -9t 
SG, OL ICE-6G_Cf 
Lecavalier Huy3 Lecavalier et al., 2014 1D (120, 0.5, Huy3/ICE-5G | SH(256), RSL, ice extent, -19# 
2) IC, RF, SG, paleo thinning 
OL rates 
Sasgen GGG1D.0 Khan et al., 2016 VM-GPS 42 modified SH(256)/F GPS, RSL +17" 
79 j 
Martinec, 2000 GREEN1 E(radial), 
IC, RF, SG, 
OL 
Peltier ICE-6G_D | Peltier et al., 2015 VMB5a (1D) & ICE-6G_D@ | SH(512) | GPS, RSL, Earth -10? 
(VM5a) rotation 
van der Wal SL-dry- King et al., 2016 3D, power- Combination GPS, RSL, +21* 
4mm/W law rheology of W12 seismic 
12 (Antarctica) velocities (Earth 
and ICE-5G model) 
Spada SELEN 4 Spada et al., 2018 VM5a (3-layer ICE-6G_Cf SELEN4: As for -27* 
average of 1D SH(128), ICE-6G_Cf 
model) & IC, RF, SG, 
OL 
Refs. 34,42,51,55,111,113,114 | 
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Extended Data Table 2 | Details of the SMB models used in this study 


Contributor Loupe | see Publication * 


2015 
oe oe 
2018 
Langen HIRHAM5S Lucas-Picher 


et al., 2012 


Area 


(10° km2) 
1.73 


Grid (Gt/yr) (Gt/yr) (Gt/yr) 


11 | ee eed 350 721 311 


Fettweis MARv3.6 Fettweis et 
al., 2017 


Noél RACMO2.3d Noél et al., 
2016 
2018 

Cullather MERRA-2 Gelaro et al., GA-n 


2017 


Hanna Hanna et al., 
2012 


2017 
2010 


Refs, 91321.46.115-119 


®Main reference publication; additional references are provided in Supplementary Table 1. 


®SMB model class; regional climate model (RCM), global numerical analysis (GA), process model (PM). Native resolution (n) and downscaled (d) models are also identified. 
Averages over the period 1980-2012 for the Greenland Ice Sheet excluding peripheral ice caps and using the drainage basins from ref. °”. 


Extended Data Table 3 | Rate of Greenland Ice Sheet mass change for 2005-2015 


Technique Mass balance (Gt/yr) s.d.(Gt/yr) range (Gt/yr) 


Altimetry* -244+15 43 122 
Gravimetry -248 + 18 31 104 


Input-Output Method -281 +425 
All -255 + 20 


Estimates of ice-sheet mass balance from satellite altimetry, gravimetry the input-output method and from all three groups during the period 2005-2015. The average s.d. and ranges of indi- 
vidual estimates within each group during the same period are also shown. *No altimetry data in 2010. 
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The accretion of volatile-rich material from the outer Solar System represents a 
crucial prerequisite for Earth to develop oceans and become a habitable planet’. 


However, the timing of this accretion remains controversial* ®. It has been proposed 
that volatile elements were added to Earth by the late accretion of alate veneer 
consisting of carbonaceous-chondrite-like material after core formation had 
ceased®*"°. This view could not be reconciled with the ruthenium (Ru) isotope 


composition of carbonaceous chondrites 


511 which is distinct from that of the modern 


mantle”, or of any known meteorite group’. As a possible solution, Earth’s pre-late- 
veneer mantle could already have contained a fraction of Ru that was not fully 
extracted by core formation”. The presence of such pre-late-veneer Ru can only be 
established if its isotope composition is distinct from that of the modern mantle. Here 
we report the first high-precision, mass-independent Ru isotope compositions for 
Eoarchaean ultramafic rocks from southwest Greenland, which display a relative 

?°Ru excess of 22 parts per million compared with the modern mantle value. This 
?0°Ru excess indicates that the source of the Eoarchaean rocks already contained a 
substantial fraction of Ru before the accretion of the late veneer. By 3.7 billion years 


ago, the mantle beneath southwest Greenland had not yet fully equilibrated with late 
accreted material. Otherwise, no Ru isotopic difference relative to the modern mantle 
would be observed. If constraints from other highly siderophile elements besides Ru 
are also considered”, the composition of the modern mantle can only be reconciled if 


the late veneer contained substantial amounts of carbonaceous-chondrite-like 
materials with their characteristic °°Ru deficits. These data therefore relax previous 
constraints on the late veneer and are consistent with volatile-rich material fromthe 
outer Solar System being delivered to Earth during late accretion. 


Ruthenium is a highly siderophile element (HSE) and is therefore 
expected to be sequestered in the metallic core during Earth’s differ- 
entiation. Contrary tothis prediction, the abundances of Ru and other 
HSEs in the modern mantle are higher than expected compared with 
metal-silicate equilibrium conditions’. This observation is most 
commonly explained by HSE replenishment of the mantle through the 
addition ofa late veneer after core formation. Relative abundances of 
HSEs that are close to chondritic compositions in the mantle suggest 
that the late veneer must have consisted of primitive meteoritic mate- 
rial’”"®, amounting to -0.5% of Earth’s mass®. The chemical composition 
of the late veneer and its origin are a longstanding matter of debate, 
especially inthe context of how and when Earth accreted its water and 


volatiles®©*!°, Previous studies debated whether significant amounts 
of volatile-rich carbonaceous-chondrite-like material were added by 
the late veneer during the final stages of Earth’s accretion®*”” or had 
already been incorporated during earlier stages of Earth’s growth?” 

Mass-independent ruthenium isotopic variations among meteorites 
and Earth have provided evidence that the late veneer was derived 
from reduced and volatile-poor inner Solar System materials most 
similar to enstatite chondrites*"”. This is in contrast to constraints 
from the relative abundances of volatile elements such as selenium 
(Se), tellurium (Te) and sulfur (S) and the Se isotope composition in 
the silicate Earth that were used to argue for aCM or Cl carbonaceous- 
chondrite-like late veneer composition”*"°. Owing to its distinct Ru 
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isotope composition, volatile-rich carbonaceous-chondrite-like mate- 
rial from the outer Solar System was excluded as possible late-veneer 
source material*”, and thus the late veneer seemed unlikely to be the 
primary source of water and volatiles on Earth>”. It should be noted, 
however, that this conclusion depends on the premise that the Ruin 
Earth’s mantle originates solely from the late accreted materials that 
were added after cessation of core formation" >!°!8, If Earth’s pre-late- 
veneer mantle retained a significant fraction of Ru during metal-silicate 
differentiation®”°, as recently suggested, this conclusion would be 
invalid. Investigating Ru isotope signatures in the putative remnants 
of pre-late-veneer mantle would thus not only provide insights into the 
timescales and efficiencies of mixing the late veneer into Earth’s mantle, 
but also introduce constraints onthe composition of the material that 
was added as alate veneer. 

To our knowledge, no unambiguous isotopic evidence for the 
preservation of pre-late-veneer mantle on Earth existed until now. 
For instance, resolvable excesses in*W reported for 3.8 billion-year- 
old (Gyr-old) Archaean rocks from Isua (Greenland) and Acasta (Can- 
ada) in conjunction with relatively low HSE abundances observed in 
3.5-3.2-Gyr-old Archaean komatiites from the Pilbara Craton (Australia) 
and the Barberton greenstone belt (South Africa) were interpreted 
to reflect sluggish mixing of the late veneer into the early Archaean 
mantle”, However, it was later suggested that the mantle sources 
of the 3.8-3.7-Gyr-old Isua supracrustal belt (ISB) rocks, including 
3.8-Gyr-old Eoarchaean peridotites from the Narssaq ultramafic body 
(NUB) andthe south of the Isua supracrustal belt (SOISB), already had 
HSE abundances at about 60-100% of the modern mantle value’*”?. 
This suggests that the late veneer was to a large extent mixed into the 
ambient mantle by ~3.8 billion years ago (Ga). To reconcile *W excesses 
with the presence of modern-mantle-like HSE abundances, it was pro- 
posed that asmall amount of core material could have been entrained 
into proto-Earth’s mantle as a consequence of the Moon-forming giant 
impact”>**, However, W anomalies could also be generated by early 
mantle differentiation processes during approximately the first 50 Myr 
of the Solar System?” or by core-mantle interactions inthe sources of 
mantle plumes”. In summary, !W and HSE concentration data alone 
fail to provide an unambiguous test of whether pre-late-veneer mantle 
domains were preserved. 

Here we explore the potential use of mass-independent Ru isotope 
variations in terrestrial rocks as atool toinvestigate whether pre-late- 
veneer isotope signatures can be found in the Archaean mantle. While 
the Ru isotope composition of the modern mantle is well constrained”, 
this is not the case for the Archaean mantle. To address this issue, we 
determined the Ru isotope composition for a set of ultramafic rocks 
from different Eoarchaean and Palaeoproterozoic terranes (Extended 
Data Table 1; see Methods for details). We focus on the °Ru/'Ru and 
102Ru/Ru ratios to constrain the Ru isotope compositions of the man- 
tle sources of these rocks because these isotope ratios are measured 
at the highest precision and also show the largest variability among 
meteoritic materials*’’*". The results are reported as € unit (0.01%) 
deviations of mass bias-corrected !°°Ru/Ru and !Ru/™Ru ratios 
from aterrestrial standard. 


Exotic composition of Archaean mantle 


We report Ru isotope data for samples from four different cratons. 
The Ru isotope compositions obtained for ultramafic samples from 
the Pilbara Craton (3.5-3.2 Gyr old), the Superior Province (Abitibi 
greenstone belt, 2.7 Gyr old) and the Kaapvaal Craton (Bushveld Com- 
plex, 2.05 Gyr old) are indistinguishable from the Ru solution standard 
(Fig. 1), indicating that their Ru isotope compositions reflect that 
of the modern terrestrial mantle. By contrast, Eoarchaean 3.8-3.7- 
Gyr-old ultramafic rocks from the North Atlantic Craton, originating 
from various localities of the Itsaq gneiss complex (IGC) in southwest 
Greenland (the NUB, SOISB, ISB and the Ujaragssuit Nunat layered 


IGC, Greenland 
NUB (8.8 Ga) 


Ujaragssuit Nunat (3.8 Ga) 


194856 


194857 


194884C 
194882B 


194907 


ISB (3.7 Ga) 


Seqi, Greenland (>3.0 Ga) 186466 


186479 


Pilbara, Australia (3.5-3.2 Ga) Pil 16-61 


Abitibi, Canada (2.7 Ga) OKUM 


LG6 
UG2 


Modern mantle 


Fig. 1|¢’°°Ru data for Archaean and Palaeoproterozoicrocks, themodern 
mantleand chondrites. The individual results for all analysed samples 
(Extended Data Table 1) are shown with the composition of the modern 
mantle”. The uncertainties for individual data points reflect the external 
uncertainty of the method (2s.d. for samples measured n<4 times) or 95% 
confidence intervals of replicate analyses of agivensample (ifn>4). The mean 
values for 3.8-3.7-Gyr-old Eoarchaean samples from the IGC in southwest 
Greenland and chromitite samples from the Bushveld complex are shown as 
solid vertical black lines. The darker grey and blue areas represent the 
respective 95% confidence intervals; the light grey and blue areas limited by 
dashed lines indicate the2s.d. uncertainty of the mean values. The uncertainty 
for the modern mantle compositionis 2 s.d. (ref."). Numbers on the right of the 
data points refer to the sample identifiers given in Extended Data Table 1. 


intrusion) exhibit a uniform and well-resolved excess in €’°°Ru of 
+0.22 + 0.04 (95% confidence interval, Fig. 1) combined witha smaller 
excess in €!Ru of +0.09 + 0.02 (95% confidence interval, Fig. 2a). 
Chromitites from the younger 3.0-Gyr-old Seqi ultramafic complex in 
southwest Greenland show the same excesses in €°°Ru and e!”Ru. The 
combined ¢!°Ru and ¢!"Ru excesses in these rocks represent mass- 
independent isotope anomalies of nucleosynthetic origin and indicate 
that the Ru in the southwest Greenland mantle source is enriched in 
nuclides produced by the slow neutron capture process (s-process) 
of nucleosynthesis compared with the modern mantle (Fig. 2a). 
The isotope excesses cannot be explained by mass-independent frac- 
tionation effects or by inherited fissiogenic Ru nuclides (see Meth- 
ods and Extended Data for details about the accuracy of the Ru 
isotope data). 
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Fig. 2|Ruisotope plot illustrating compositional differences between 
enstatite, ordinary, average carbonaceous, Cl and CMcarbonaceous 
chondrites, the modern mantle and the Eoarchaean mantle. a, The dashed 
line represents a mixing line between the modern mantle composition 

(e°Ru = 0) and an s-process component defined by Ru isotope data for pre- 
solar silicon carbide grains**. The compositions of enstatite chondrites (EC, 
€°°Ru =-0.08 + 0.04, 95% confidence interval); ordinary chondrites (OC, 
e°Ru=-0.29 + 0.03, 95% confidence interval)*, CI chondrites (CI, 
e€Ru=-0.24+0.13,2s.d.)°;CM chondrites (CM, ¢°°Ru = -0.69 + 0.38, 95% 
confidence interval)’, and average carbonaceous chondrites (average CC, 
€°°Ru =-0.90 + 0.12, 95% confidence interval)> are shown for comparison. The 
uncertainties for CI chondrites reflect a single measurement and arethus 
shown with the external uncertainty of the method (2s.d. as stated in ref.°). 
Uncertainties for the modern and the Eoarchaean mantle composition are the 
sameasstated in Fig. 1. Note that the uncertainty for the modern oceanic 
mantle composition from the literature is shownas 2s.d. (ref. 2). b, Heliocentric 
zoning of e°°Ru anomalies’.The presence of an s-process-enriched reservoir 
that contributed to Earth’s growthisinferred fromthe Ruisotope composition 
obtained for the Eoarchaean mantle of southwest Greenland (Fig. 1). Chondrite 
groups formedat increasing heliocentric distances exhibit more negative 
e°°Ru because they are more depleted ins-process Ru relative to Earth’s 
modern mantle’. The e'°°Ru uncertainty for carbonaceous chondrites in bis 
shownas 2s.d.to account for the significant within-group variation of their 
e'°Ru values (image adapted from ref. °, Springer Nature). 


The s-process-enriched compositioninferred forthe Archaean south- 
west Greenland mantle is an unexpected finding because the Ru isotope 
compositions reported for all meteorites are deficient in s-process Ru 
and exhibit negative °Ru and ¢!Ru anomalies relative to Earth’s mod- 
ernmantle**”", The southwest Greenland data provide unambiguous 
evidence for s-process-enriched building material that contributed to 
the early stages of Earth’s growth. Owing to the observed heliocentric 
zoning of €°°Ru anomalies among meteorites’, we speculate that this 
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reservoir was most probably located in the innermost region of the 
Solar System, within 1 astronomical unit(Fig. 2b). 


Pre-late-veneer Ru isotopic signature 


The Ru excess provides unequivocal evidence that the mantle source 
of the Greenland rocks did not receive the full complement of late 
veneer material”. Furthermore, it also requires that Ru (and possibly 
other HSEs) was not completely stripped from the mantle during the 
latest stages of core formation”. Otherwise, no Ru isotope anomaly 
would be observed. The uniform and ubiquitous presence of thee!°°Ru 
anomaly in various 3.8-3.7-Gyr-old ultramafic rock types from different 
Eoarchaean terranes in Greenland (Isuakasia, Feringehaven) suggests 
that a larger mantle domain is lacking a full late veneer component?>”. 
The presence of the e°°Ru anomaly inthe younger Mesoarchaean chro- 
mitites from Seqi (minimum age of 3.0 Gyr, Akia terrane) also indi- 
cates that even 700 Myr later, the southwest Greenland mantle had not 
fully equilibrated with the late veneer. Sucha prolonged timescale for 
mixing-in of the late veneer componentis consistent with significant 
HSE depletions observed in Archaean mafic rocks from the Pilbara and 
Kaapvaal cratons”, which previously had been explained by sluggish 
inmixing of late veneer material. 

As outlined above, the €’°°Ru excess identified in Eoarchaean ultra- 
mafic rocks from southwest Greenland indicates that Ru was not 
completely sequestered in the core, most probably because some late 
accretionary component had been delivered during the waning stages 
of core formation". Depending on the composition of this early 
late veneer material, the °Ru excess measured in the Greenland rocks 
would then represent a minimum estimate for the e’°°Ru excess of the 
pure pre-late-veneer mantle. The nature of the early component that 
supplied the °Ru excess and was already mixed into the Greenland 
mantle before 3.8 Ga, probably inner Solar System material (Fig. 2b), 
can be further constrained by osmium isotope systematics. This is 
because the initial osmium isotopic compositions of chromitite and 
peridotite samples from the IGC overlap the ®”Os/*8Os composition of 
chondrites at 3.8 Ga (refs. #73233) (Extended Data Table 2). Assuming 
that the positive ¢°Ru anomaly and the chondritic Os signature were 
both imparted by this component, it is unlikely that it is represented 
by any known chondritic meteorites because these all exhibit negative 
e°Ru values and chondritic Os isotope compositions. Importantly, 
owing to its positive ¢°Ru value, this material cannot derive from a 
carbonaceous-chondrite-like Moon-forming impactor® because car- 
bonaceous chondrites also exhibit the most negative ¢!°°Ru values 
among all known chondrite groups’ (Fig. 2). 


Carbonaceous-chondrite-like late veneer 


Regardless of the precise nature and origin of the early accreted compo- 
nent, the °Ru excess inferred for the Eoarchaean southwest Greenland 
mantle source could only be balanced by the addition of chondritic 
materials with negative ¢°Ru to yield the composition of the mod- 
ern mantle (¢°°Ru = 0). This mixing relationship is further illustrated 
in Fig. 3, where possible ¢°°Ru compositions for the pre-late-veneer 
mantle are calculated by subtracting enstatite, ordinary or carbona- 
ceous-chondrite-like materials from the Ruisotopic composition of the 
modern mantle. The model is based on a recently proposed inefficient 
core formation scenario where about 20% of the Ru (-1.4 ng g“) inthe 
modern mantle derives from the pre-late-veneer stage”. Assuming 
a minimum late accretion contribution of 60% for the >3.8-Gyr-old 
Itsaq mantle source", only the addition of a late veneer consisting of 
carbonaceous chondrites could account for €°Ru = 0, as observed for 
the modern mantle” (Fig. 3). The required proportion of late accreted 
material would amount to amaximum estimate of 0.3% of Earth’s mass 
of average carbonaceous chondrite or CM carbonaceous chondrite 
material, consistent with a recent estimate based on Se isotopes”®. 
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A late veneer consisting of CM-like material is also supported by the 
abundances of volatile elements in the silicate Earth?*”°. This conclu- 
sion remains robust, even if a larger fraction of Ru was present in the 
pre-late-veneer mantle or the late accretion component inthe mantle 
by 3.8 Ga was >60%, but in these cases alower carbonaceous chondrite 
mass fraction would be sufficient. Ordinary chondrites would only 
become viable late veneer materials ifthe Greenland mantle contained 
a significantly lower late veneer contribution by 3.8 Ga (<50%). A late 
veneer consisting of carbonaceous chondrites is consistent with the 
relative abundances of S-Se-Te and the Seisotopic composition of the 
modern mantle”*””, but the addition ofa late veneer composed of ordi- 
nary chondrites cannot be reconciled with these constraints, because 
the relative abundances of S-Se-Te and the Se isotope composition of 
ordinary chondrites are distinct from those of Earth’s mantle”. If a 
major part of the late veneer consisted of core fragments from differ- 
entiated impactors”, one potential caveat would be that this material 


Fig. 3| Model estimates for the amount of late veneer added tothe 
Eoarchaean mantle based on Ruisotope compositions. a-—c, The model 
illustrates the effect of subtracting variable mass fractions of late veneer from 
the Ru isotope composition of the modern mantle. The composition of the 
modern mantle was fixed at €°Ru = 0, as indicated by the composition 
obtained for samples from the Bushveld complex (Extended Data Table 1), 
whichis indistinguishable from previously reported data for the modern 
oceanic mantle composition”. Dotted lines indicate the respective mass 
fractions of different chondritic late-veneer compositions subtracted from the 
modern mantle composition. Solid black lines show the minimum and 
maximum e°Ru values for different chondrite classes in a: enstatite 
chondrites (EC, e°°Ru = -0.08 + 0.04, 95% confidence interval) and ordinary 
chondrites (OC, e°Ru =-0.29 + 0.03, 95% confidence)°; b: carbonaceous 
chondrites (CC, e°Ru=-0.90 + 0.61, 2s.d.; +0.12, 95% confidence interval)° 
and Clchondrites (CI, ¢°°Ru =-0.24 +0.13, 2s.d.)5;and c: CM chondrites (CM, 
© Ru =-0.69 + 0.38, 95% confidence interval)*. The uncertainties for 
carbonaceous chondrites in bare given as 2s.d.to account for significant 
within-group variation in their ¢°°Ru values (the dashed line indicates 95% 
confidence interval uncertainty). The2s.d. uncertainty for CI chondrites 
reflects the external uncertainty of the method (as statedin ref.*°). The amount 
of subtracted late veneer material for each respective chondrite composition 
was adjusted to match a presumed Ru concentration in the pre-late-veneer 
mantle of -1.4ngg"™, corresponding to -20% of the Ru contained in the modern 
mantle. The yellow boxes indicate the composition of the Eoarchaean mantle 
inferred from the mean value of all analysed 3.8-3.7-Gyr-old samples 
originating from various localities of the IGC (e°°Ru=+0.22 + 0.04, average 
value shown as solid black line, dashed lines indicate 95% confidence interval 
uncertainty, Extended Data Table 1). The solid vertical dashed line indicates the 
minimum late veneer contributioninferred for the mantle source of peridotite 
samples from the NUB and the SOISB based on previously reported *’0s/"8Os 
data and HSE concentrations“. The parameters used for the mixing model are 
givenin Extended Data Table 3. 


cannot readily account for chondritic S-Se-Te and broadly chondritic 
HSE relative abundances in Earth’s mantle”*"*"8, 

Collectively, our data imply that the distinct °°Ru isotope excess in 
the Eoarchaean southwest Greenland mantle sourceis best explained by 
late mixing of a carbonaceous-chondrite-like late veneer fractioninto 
Earth’s mantle. Thus, contrary to previous Ru isotope constraints on 
thelate veneer>”, these data imply that significant amounts of volatile- 
rich outer Solar System materials including water and volatiles were 
added with the late veneer. This revised view also agrees with other 
constraints, such as those independently obtained from the relative 
abundances and isotope compositions of Earth’s volatile elements'?*"”, 
which also indicate that the major share of Earth’s volatiles was 
inherited from a carbonaceous chondrite source’ *. Finally, our data 
demonstrate that investigating the Ru isotope composition of terres- 
trial rocks represents a powerful analytical tool for identifying primor- 
dial mantle heterogeneities arising from incomplete equilibration of 
the ambient mantle with Earth’s late-stage building blocks. 
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Methods 


Samples 

The samples analysed in this study comprise ultramafic rocks from 
four different cratons. The North Atlantic Craton is represented by 
peridotite and chromitite samples of from various localities of the 
Eoarchaean IGC and the Mesoarchaean Seqi ultramafic complex in 
southwest Greenland. The sampleset is complemented by chromitites 
from the Kaapvaal Craton (Bushveld Complex, South Africa) and the 
Pilbara Craton (Australia) and a komatiite reference sample from the 
Superior Province (Abitibi greenstone belt, Canada). Where available, 
the Ru concentration and osmium isotope data for the samples ana- 
lysed in this study are given in Extended Data Table 2. There were no 
data available for some samples, so Ru and Os data obtained for similar 
samples from the same location are listed. 

The IGC in southwest Greenland represents one of the few locali- 
ties where remnants of Eoarchaean mantle are preserved. The IGC 
comprises two Eoarchaean crustal terranes (Isuakasia and Feringe- 
havn), where possible mantle rocks are exposed as ultramafic lenses 
in the 3.8-3.7-Gyr-old Isua supracrustal belt and ultramafic bodies in 
the 3.8-Gyr-old SOISB, both located inthe Isuakasia terrane®>”*. In the 
Feeringehavn terrane, suchrocks are exposed inthe 3.8-Gyr-old NUB”. 
The peridotite and chromitite samples investigated in this study were 
selected to cover all of these different localities. 

Samples 10-9 and 10-11 from the NUB are massive, coarse-grained 
peridotites. Olivine is the dominant phase in these rocks. They also 
contain orthopyroxene, amphibole, spinel and magnetite”. The chemi- 
cal compositions of these samples, including concentration data for 
highly siderophile elements and '*’Os/*°Os data, were reported ina 
previous study. Sample 10-27 is a harzburgite from the SOISB. The 
mineral assemblage of this rock is comparable to other harzburgites 
collected from the same locality. These rocks are typically spinel- 
peridotites with harzburgitic mineral assemblages composed mainly 
of olivine and variable amounts of orthopyroxene, amphibole and 
opaque phases“. A minimum age of 3.8 Gyr was estimated for the 
analysed SOISB and NUB peridotites on the basis of field relation- 
ships with surrounding 3.8-Gyr-old tonalitic gneisses and crosscutting 
dykes*3637, 

Two ofthe investigated chromitites (194856, 194857) were collected 
from the Ujargssuit Nunat layered intrusion®®. For chromitites from this 
locality, Pt-Os model ages as old as 4.36 Gyr were reported”. Samples 
194882B and 19488C are chromitites froma locality close tothe inland 
ice that most probably belongs to the same sequence as the chromitites 
from the Ujargssuit Nunat layered intrusion. 

The dunite sample 194907 was collected from an antigorite lens 
located within the northeastern part of the 3.7-Gyr-old Isua supracrustal 
belt, which has previously been referred to as Dunite Lens B*°*!. The 
ISB dunites also contain orthopyroxene and spinel, and very minor 
amounts of clinopyroxene that has mostly been altered*°. 

Two of the analysed chromitites (186466, 186479) derive from the 
Seqi ultramafic complex. The major and trace element compositions, 
including concentration data for platinum group elements of these 
samples, were reported ina previous study”. The Seqi ultramafic com- 
plex represents a peridotite enclave hosted by tonalitic orthogneiss 
within the 3.0-Gyr-old Akia terrane. Aminimum age for the ultramafic 
body is constrained by 2.98-Gyr-old crosscutting granitoid sheets”, 
although unpublished Re-Os isotope data show a consistent 3.1 Gyr 
mantle depletion age for the Seqi ultramafic complex. The highly 
refractory peridotites and chromitites are interpreted as represent- 
ing the remnant of a fragmented layered complex or amagma conduit. 
The ultramafic rocks formed froma magnesian-rich, near-anhydrous 
magma as olivine dominated cumulates with high modal contents of 
chromite. Their parental magma was generated by high degrees of 
partial melting ofa mantle source that probably represents the precur- 
sor of the regional sub-continental lithospheric mantle. 


The chromitite sample Pil 16-61 was collected from the Warawoona 
Group located within the Pilbara Craton. The chromitite may beas old 
as the associated Mount Ada basalt unit (3.5 Gyr) or it may be part of 
the younger 3.2-Gyr-old Dalton Suite sill complex®. 

The investigated komatiite rock (OKUM) from the 2.7-Gyr-old Abitibi 
greenstone belt (Canada) is acommercially available rock reference 
sample provided by the International Association of Geoanalysts. 

Twochromitites (UG2, LG6) from the 2.05-Gyr-old Bushveld complex 
(South Africa) were used as areference sampleto validate the analytical 
method and toassess the precision and the accuracy of the Ruisotope 
measurements. The Ru isotope composition of UG-2 was previously 
determined employing a different digestion method (alkaline fusion)°. 


Ruthenium separation and purification 

The required amount of sample material to yield sufficient Ru for a 
high-precision measurement was estimated on the basis of previously 
reported Ru concentrations (10-9, 10-11, UG-2, LG-6, OKUM, 186466, 
186479) or Ru concentrations reported for samples of similar 
composition fromthe same locality (10-27, 194907)7***. When informa- 
tion was not available (for example, for samples 194856, 194857and 
Pill6-61), the Ru concentrations were determined from a1 g powder 
test portion digested in a high-pressure asher in reverse aqua regia 
(5 ml concentrated HNO, and 2.5 ml concentrated HCl). Before quan- 
tification of Ru for these samples, the digestion solution was dried 
down, converted twice with 5 ml of 6 M HCI, taken up in 0.2 M HCl and 
loaded on a cation column to remove matrix elements as described 
below. Ruthenium concentrations were determined in the eluted Ru 
fractions by external calibration using a quadrupole inductively cou- 
pled plasma mass spectrometer (ThermoScientific iCap). We note 
that the concentrations determined by this procedure may underesti- 
mate the actual concentration of samples because some Ru may have 
been lost as a volatile tetroxide (RuO,) when the aqua regia solutions 
were dried down. These concentrations are therefore considered to 
be only approximate values. In a similar manner, the Ru contents of 
two chromitites (194882B, 194884C) were estimated from a1% sam- 
ple aliquot taken after NiS digestion and cation column chemistry as 
described below. However, we note that these estimates represent only 
approximate values too because the Ru yield of the NiS procedure is 
<100%. 

For the NiS procedure, powder aliquots of 5-10 g were digested using 
a NiS fire assay technique”. For chromitite samples with high Ru con- 
centrations (UG-2, LG-6, 194856, 194857, Pil16-61), one NiS digestion 
with5 g of sample powder was needed to yield sufficient amounts of Ru. 
Multiple NiS digestions with 10 g of sample powder had to be prepared 
for ultramafic samples with lower Ru concentrations (harzburgites, 
dunites, komatiites and some chromitites). The total number of NiS 
digestions andthe amount of sample material used for each respective 
NiS bead are given in Extended Data Table 2. Appropriate amounts of 
Ni,S, borax and Na,CO, were added toeach5-10 gsample portion and 
thoroughly mixed. The mixture was fluxed in a muffle furnace for 75 min 
at 1,000 °C. After cooling, the NiS beads were physically removed from 
the quenched silicate melt. 

For the majority of samples, the NiS procedure resulted in about one 
to three beads of about Icm in diameter that could readily be recov- 
ered fromthe quenched silicate. The Ru yield of the NiS procedure was 
determined on the basis of sample powders with known Ru concentra- 
tions (UG-2, OKUM, 10-9, 10-11, 186466, 186479). The Ru yield for these 
samples usually varied from 60-95%. However, incase of three replicate 
UG-2 digestions the Ru yields were only of the order of 10-20%. The 
lower yields resulted from incomplete homogenization and subsequent 
inefficient extraction of NiS beads from the quenched silicate. The NiS 
digestions for these samples produced finely dispersed millimetre- 
to micrometre-sized spherules within the quenched silicate. Careful 
homogenization of the NiS sample-flux mixtures before digestion 
helped to avoid this problem. 
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The NiS beads were crushed in an agate mortar and transferred into 
60 ml Savillex beakers to which 30 ml of concentrated HCI was added. 
The solutions were evaporated to near dryness ona hotplate at 100 °C. 
This step was repeated with another 30 ml of concentrated HCl and 
20 ml of 1MHCI. 

Ruthenium was separated from the dissolved NiS beads using cation 
exchange chromatography“. Each dissolved NiS bead from asingle fire 
assay digestion was split over three cation columns filled with 10 ml 
AGSO X8 resin, respectively. The resin was equilibrated with 20 ml of 
0.2 MHCI. Ruthenium and other platinum group elements were loaded 
and eluted in 14 ml of 0.2 MHCI. The eluted Ru fractions from each 
sample were recombined and a small aliquot (1%) was taken to deter- 
mine the amount of Ru and remaining matrix elements (mainly Ni). If 
significant amounts of matrix elements passed through the column (if 
Ni/Ru > 1),the combined fractions of samples were passed for asecond 
time over a single 10 ml cation column. The Ru yields from the cation 
column were usually >95%. The eluted sample solutions were dried 
down ona hotplate, recombined and Ru was further purified using a 
macrodistillation unit as described elsewhere**. After the distillation, 
the purified Ru fractions were dried down ona hotplate and dissolved 
in0.5 ml of 0.28 MHNO,, from whicha small aliquot was prepared asa 
predilution to determine the Ru yield and to check for potential inter- 
fering elements. The distillation yields were usually between 40 and 
80%. The total Ru yield of the analytical procedure, including NiS diges- 
tion, column chemistry and distillation, is typically 30-70%, estimated 
from samples with known Ru concentrations (UG-2, OKUM, 10-9, 10-11, 
186466, 186479). The total yield of the three UG-2 digestions was only 
6-21%. The yields of the distillation for these samples were 50-80%, so 
thelowtotal Ru yields are caused by inefficient extraction of NiS beads, 
as described above. However, neither the total Ru yield of the entire 
analytical procedure nor the respective yields from the NiS digestion 
or the Ru distillation have any effect onthe accuracy of the Ru isotope 
data (Extended Data Fig. 1). 

The procedural blank for a single NiS digestion, including column 
chemistry and distillation, varied between 185 and 435 pg (n =3). The 
blank contribution was «1% for the majority of samples and <2% for 
OKUM and 194907 given that >30 ng of Ru were processed for each 
respective NiS digestion. 


Mass spectrometry 

The Ru isotope measurements were performed using a ThermoSci- 
entific Neptune Plus multicollector inductively coupled plasma mass 
spectrometer in the Institut fiir Geologie und Mineralogie at the 
University of Cologne. For the measurements, the Ru fractions were 
further diluted in 0.28 M HNO, to yield Ru solutions of 100 ng mI". 
The diluted solutions were checked for the presence of interfering 
elements (Zr, Ni) that could affect the accuracy of the isotope data and 
cannot be monitored online during the measurements. The sample 
solutions were introduced into the mass spectrometer at an uptake 
rate of -50 pl min“ using an ESI microflow PFA nebulizer attached toa 
Cetac Aridus II desolvator. The isotope measurements were conducted 
with total ion beam intensities between 8 x 10“ and 2 x10” A, obtained 
for 100 ng mI Ru sample and standard solutions using conventional 
NiH-cones. The set-up was optimized to yield oxide production rates 
«1% (CeO/Ce). The measurements were conducted in static mode and 
the seven stable Ru isotopes (Ru, *°Ru, ”’Ru, !°°Ru, !Ru, '°?Ru and 
104Ru) as well as Mo and Pd were monitored simultaneously. Each 
Ru isotope analysis consisted of an on-peak baseline on a solution 
blank (40 integrations of 4.2 s) followed by 100 integrations of 8.4's 
for each sample or standard solution and typically consumed about 
90 ng Ru. Each sample analysis was bracketed by measurements of an 
in-house Ru standard solution (Alfa Aesar Ru). The data were internally 
normalized to *Ru/'Ru = 0.7450754 (ref. *) using the exponential law 
to corrected for mass-dependent isotope fractionation. The isotope 
data are reported as e'Ru = ([(Ru/ Ru sampie)/(CRU/ RU standara) | ~ 1) x1 


0*, calculated relative to the bracketing standard of each analytical 
session. The accuracy and precision of the Ru isotopic measurements 
were evaluated by replicate digestions and multiple analyses of the 
UG-2 chromitite (Bushveld), which was used as a reference sample. 
The Ru isotope data obtained for UG-2 in this study agree well with 
previously reported data’, where a different digestion method (alka- 
line fusion) was used for sample decomposition. This demonstrates 
that the isotope data obtained by the NiS method yield accurate 
results. The external reproducibilities (2 s.d.) obtained for a total 
number of 103 individual measurements from 8 replicate UG-2 diges- 
tions are +0.43 €°Ru, +0.49 €°Ru, +0.12 €°Ru, +0.16 €’Ru and 
+0.30 !4Ru. 


Correction for mass-dependent isotope fractionation 
The exponential law is one of the most commonly used methods to 
correct for natural and instrumental mass-dependent isotope frac- 
tionation. One potential caveat in using this correction for Ruisotope 
measurements could be that the distillation technique used to purify 
the Ru could induce an isotope fraction that would not follow the 
exponential law. This could cause apparent isotopic anomalies for a 
given sample as a consequence of inaccurate mass fractionation cor- 
rection. The exponential law assumes that the logarithmic fractiona- 
tion 6 = In(r/R) of a given isotopic ratio is expressed as a function of 
the mass log difference A(InM) = In(M,/M,). Considering two isotopic 
ratios (r=°’Ru/Ru and r’= !°°Ru/!“Ru) the exponential law predicts 
that mass fractionation produces a linear array inaIn(r/r) plot’’. This 
is illustrated in Extended Data Fig. 2 for the measured raw ratios of 
*Ru/ Ru andRu/Ru. The ratios in this figure are not corrected for 
mass fractionation and are normalized to a reference ratio (Rand R’, 
respectively)”. If the mass fractionation experienced by the samples 
is accurately described by the exponential law, the ratios should fall on 
alinear array with a slope of -0.5. Two distinct mass fractionation lines 
can be observed in the plot for different sessions. The slopes for both 
groups of sessions are indistinguishable within error and are in very 
good agreement withtheslope predicted by the exponential law. Most 
importantly, the samples purified by distillation fall on the same respec- 
tive mass fractionation line as their associated Alfa Aesar bracketing 
standards. This clearly demonstrates that the Ru distillation does not 
induce any non-exponential mass-fractionation effects for the samples 
in comparison with the bracketing standard. This observation is also 
independent from the Ru yield of the samples and does not change if 
other Ruisotope ratios are considered. Thus, the Ru isotope anomalies 
obtained for the southwest Greenland samples cannot reflect inaccu- 
rate mass fractionation correction. The shift observed for samples and 
associated standards plotting ona distinct mass fractionation array 
in Extended Data Figure 2 was caused by maintenance in May 2019 
during which a Faraday cup was replaced. However, because the data 
are reported as relative deviations in parts per 10‘ from the Alfa Aesar 
bracketing standard, and because samples and bracketing standards 
are shifted by the same magnitude, this does not affect the accuracy 
of the data. This is also confirmed by replicate digestions of sample 
10-9 that were analysed in both groups of sessions. The €°Ru values 
for this sample are indistinguishable within analytical uncertainty. 
Another argument against non-exponential mass-fractionation 
effects is that nucleosynthetic Ru isotope anomalies caused by vari- 
able contributions of s-process Ru nuclides would not lead to any ¢'*Ru 
anomalies in the ’Ru/‘“Ru normalization scheme. As the e'™Ru values 
for all analysed samples fall within the external reproducibility of the 
method (+0.30 for ¢!°*Ru), this demonstrates that sample distillation 
does not cause non-exponential mass fractionation effects. 


Isobaric interferences 

The accuracy of the Ru isotopic measurements could be compro- 
mised by isobaric interferences from Mo, Pd, Zr and Ni argide spe- 
cies, or potential effects relating to remaining S in the analysed sample 


solutions. While interferences from Mo and Pdare simultaneously moni- 
tored and corrected for during the measurements“, isobars from Zr 
and Ni argides are not. Owing to the design of the collector block and 
limited availability of collectors, Zrand Ni could not besimultaneously 
monitored during the measurements. However, Zr is very effectively 
separated from Ru by cation exchange chemistry. Hence, all analysed 
sample solutions (except for one digestion of sample 10-27) had Zr 
intensities indistinguishable from the background of the 0.28 MHNO, 
and the Ru standard solution. Only one analysed sample (10-27) had 
a slightly elevated Zr/Ru ratio of 0.0008 and, hence, its e?°Ru value is 
slightly elevated due to an isobaric interference from *°Zr that could 
not be corrected. The Zr contained inthe one analysed sample solution 
most probably reflects a random contamination from the laboratory 
equipment used during sample preparation that was not observed for 
other samples. 

Inthe case of Niwe noticed during the initial stage of the project thata 
few processed reference samples still contained considerable amounts 
of Niafter the cation chemistry. For these samples, even after further 
purification of Ru by distillation, smaller amounts of Ni (between 1 
and 10 ng mI’) were observed in the sample solutions to be analysed. 
During the isotopic measurements Ni readily forms argide species in 
the plasma that interfere with Ruisobars™. To assess potential effects 
from Ni argide species on the measured Ru isotope data,al00ng mI 
Ru standard solution was doped with varying amounts of Ni to yield 
concentrations between 0.2 pg ml and 50 ng mI". The results of this 
test showthat the measured Ru isotope compositions for 100 ng mI“ Ru 
solutions are not affected for samples with Ni/Ru ratios <0.01 (Extended 
Data Fig. 3a, b). For sample solutions with higher amounts of Ni, posi- 
tive anomalies are observed, which are most pronounced for ’Ru and 
toa lesser extent for !°°Ru. Other Ru isotope masses (“°7Ru and!“Ru), 
owing to the lower abundance of the higher mass Ni isotopes, are not 
significantly affected by Ni argide species. To avoid any interferences 
from Niargides during the isotopic measurements, the final dilutions 
of all samples analysed in this study were carefully checked for their Ni 
contents before the analysis. The intensity of Ni, monitored by scanning 
the mass of *$Ni, in the finally diluted sample solutions was indistin- 
guishable from the background intensity observed for the Ru solution 
standard and for 0.28 M HNO, (10-30 mV on *8Ni). These negligible 
amounts of Ni are insignificant and have no effect on the measured 
data. The minimal Ni background originates from the Ni cones of the 
experimental set-up. 

To eliminate any potential effects of Sin the analysed solutions onthe 
isotopic measurements, the S from the crushed NiS beads was almost 
completely removed by evaporation as H,S gas during dissolution of 
the beads with concentrated HCI. After further purification of Ru by 
column chemistry and distillation, the S contents in the final sample 
solutions were <25 ng mI“ for all analysed samples. Tests with S-doped 
Alfa Aesar Ru standard solutions showed that even ifa100 ng mI Ru 
standard solution contains large excesses of S (S/Ru=5), the accuracy 
of the Ru isotope measurements is not compromised (Extended Data 
Fig. 3c, d). 


Nuclear field shift 

Previous studies have shown that mass-independent Ru isotope 
anomalies could be caused by nuclear field shift-induced fractionation 
effects*!. In meteorites and their components, such effects could bea 
primary feature resulting from evaporation/condensation processes. 
However, experimental studies have shown that mass-independent 
effects couldalso be generated in the laboratory during sample prepa- 
ration*, In this section, we explore the potential effects of nuclear field 
shift-induced fractionation of Ru isotopes. These fractionations can 
be predicted on the basis of differences in the mean-squared nuclear 
charge radii between nuclides of agiven element. The resulting effects 
onthe measured Ru isotopic composition can be calculated in € units 
using the following equation”: 


m,(m,- mM) Ja 
my,m, 


ow fede 
ne (Sm m,(m,— m,) 


where m,and m, are the atomic masses of the two isotopes of an ele- 
ment chosen for internal normalization, m; refers to the atomic mass 
of another isotope indexed with variablei, 5r? denotes the difference 
inthe mean-squared nuclear charge radii of the respective isotope pair 
and ais an adjustable parameter that determines the magnitude of 
mass-independent fractionation, which is a function of temperature 
Tas 1/T. In plots of e°?Ruand e”Ru versus €!°Ru (Extended Data Fig. 4), 
theslopes calculated for the nuclear field shift fractionation are clearly 
distinct from the slope predicted bya variation in s-process Ru nuclides. 
The Ru isotopic composition obtained for Eoarchaean southwest 
Greenland rocks does not plot onthe calculated slope for nuclear field 
shift but instead plots onthes-process mixing line. As such, the anom- 
alies identified in the southwest Greenland rocks cannot be explained 
by nuclear field shift-induced fractionation and therefore reflect iso- 
tope anomalies of nucleosynthetic origin. 


Fissiogenic Ru 

The spontaneous fission of uranium has been shown to produce Ru 
nuclides with relative abundances that are distinct from naturally occur- 
ring Ru (ref. °”). Fissiogenic Ru primarily consists of Ru (33.4%), “Ru 
(28.9%), }°?Ru (24.7%) and ‘Ru (12.4%). The presence of an inherited 
fission-produced fraction of Ruinarock sample would induce a charac- 
teristic isotope anomaly pattern that would be distinct from anomalies 
of nucleosynthetic or nuclear field shift origin. Because *°Ru and !°°Ru 
are nota significant fission product*’, the presence of an inherited 
fraction of fissiogenic Ruina rock sample would cause negative e”°Ru 
and e°Ru anomalies. These are not observed in any of the analysed 
samples. Onthe other hand, a deficit of such an inherited fissiogenic 
Ru component would yield positive e”°Ru and €°Ru anomalies, which 
are also not observed. This is shown in Extended Data Fig. 4b, where 
samples with an excess ora deficit of sucha fissiogenic Ru components 
would fall on a mixing line with a distinct slope. Hence, the isotopic 
composition of the samples with a positive ¢°°Ru anomaly cannot be 
explained by either an excess or a deficit of fissiogenic Ru nuclides. 


Data availability 


The data that support the findings of this study are available from the 
EarthChem library (https://doi.org/10.1594/IEDA/111462). Source data 
for Figs. 1-3 and Extended Data Fig. 1-4 are provided with the paper. 
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Extended Data Fig. 1| Ruthenium yields of the total analytical procedure 
and Ru yields fromthe distillation plotted against <’°°Ru data for replicate 
digestions of UG-2 and southwest Greenland samples. a, b, The accuracy of 
the e°Ru data are not affected by the Ru yield of the total analytical procedure 
(a) or the Ru yield of the distillation (b). The grey and blue areas represent the 
95% confidence intervals, and the light grey and light blue areas limited by 
dashed lines indicate the 2s.d. uncertainty as stated for the calculated mean 
values of the Bushveld igneous complex and the IGC (Extended Data Table 1). 
Note that it was not possible to determine the Ru distillation yield for all 
replicates of the analysed samples in b for the southwest Greenland samples. 
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Extended Data Fig. 2 | Ruthenium isotope plot showing the In values of the 
measured raw ratios for °°Ru/!“Ru and ?°Ru/"”Ru obtained for 100 ng mI 
Ru sample solutions and associated Alfa Aesar Ru bracketing standards. The 
measured isotope ratios are shown as raw ratios uncorrected for mass- 
dependent fractionation and arenormalizedto reference ratios R and R’. Two 
distinct mass fractionation lines can be observed. The slopes for both lines are 
indistinguishable within error and are in very good agreement with the 
predicted slope of 0.5, which would be expected if the exponential law could 
accurately correct the mass-dependent fractionation (see Methods for details). 
Most importantly, the samples purified by distillation fall onthe same 
respective mass fractionation line as their associated Alfa Aesar bracketing 
standards. This clearly demonstrates that the Ru distillation does not induce 
any unaccounted mass-fractionation effects for the samples in comparison 
withthe bracketing standards. The shift observed for samples and associated 
standards plotting ona distinct mass fractionation array was caused bya 
maintenance on the mass spectrometer in May 2019, during which a Faraday 
cup was changed. However, this does not affect the accuracy of the isotopic 
data because the analysed samples and their associated bracketing standards 
are shifted by the same magnitude, and the isotopic data are expressed as € 
values from the bracketing standards. 
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Extended Data Fig. 3 | Ruthenium isotope data obtained for 100 ng mI‘ Alfa 
Aesar Ru standard solutions doped with variable amounts of NiandS. 

a,b, The effect of Niargide interferences on the measured €°Ru (a) and e**Ru 
(b) isotopic compositions of a 100 ng mI Ru standard solution doped with 
varying amounts of Ni. The accuracy of the measured isotopic compositions is 
notaffected for samples with Ni/Ru <10~.c, d, High amounts of S (S/Ru=5) in 
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theanalysed sample solutions have no effect onthe accuracy of the measured 
e'°Ru (c) and e*Ru (d) data. Vertical dashed lines inc and d indicate the range 
of S/Ru ratios in the analysed samples. The blue areas indicate the external 
reproducibility of the method as defined by 2s.d. of the replicate digestions 
and repeated analysis of the UG-2 reference sample (see Methods). 
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Extended Data Fig. 4| Rutheniumisotope plots illustrating systematic 
compositional differences between EC, OC and CI, modern mantle andthe 
pre-late-veneer mantle in comparison with mixing lines calculated between 
the modern mantle composition and isotopic variations caused by variable 
contributions of s-process Ru nuclides, afissiogenic Ru component and 
variations of nuclear field-shift induced isotope fractionations. 

a, Ruthenium isotope data for the Eoarchaean mantle of southwest Greenland, 
the modern mantle and the three types of carbonaceous chondrite in 
comparison witha mixing line calculated between the composition of the 
modern mantle (e’Ru = e¢!°Ru = 0) and ans-process component* (dashed line) 
and the slope calculated for a nuclear field shift-induced (NSF) fractionation™ 
(solid black line). b, Ruthenium isotope composition of the Eoarchaean mantle 
in comparison with mixing lines calculated between the composition of the 
modern mantle (e?°Ru = €!°Ru = 0) and afissiogenic Ru component™ (dotted 
line), an s-process component™ (dashed line) and the slope calculated for an 
NSF isotope fraction® (solid black line). 


Extended Data Table 1| Ruthenium isotope data for ultramafic rocks 


Sample* Rockt nt e%®Rus + e%Rus + e100Rus + €102Rus + €104Rus + 
Greenland 


Itsaq gneiss complex (IGC) 
Narssaq ultramafic body (NUB), 3.8 Gyr 


10_9 H 1 0.13 0.43 -0.04 0.49 0.19 0.12 0.10 0.16 0.17 0.30 
replicate H 2 0.28 0.43 0.42 0.49 0.30 0.12 0.13 0.16 -0.01 0.30 
10_11 D 2 -0.07 0.43 0.29 0.49 0.34 0.12 0.08 0.16 -0.10 0.30 
South of the Isua supracrustal belt (SOISB), 3.8 Gyr 

10_27 H 1 (1.19)1 0.43 0.33 049 0.23 0.12 0.03 0.16 0.04 0.30 
replicate 1 0.03 0.43 0.58 049 0.23 0.12 0.18 0.16 0.00 0.30 
Ujaragssuit Nunat complex, 3.8 Gyr 

194856 Chr 6 -0.04 0.20 0.18 0.17 0.17 0.09 0.06 0.06 -0.10 0.08 
replicate 5 -0.27 0.29 0.04 0.35 0.15 0.11 0.07 0.04 -0.03 0.13 
replicate 3 -0.25 0.43 0.40 0.49 035 0.12 0.13 0.16 -0.11 0.30 
194857 Chr 10 = -0.20 0.09 0.06 0.18 0.19 0.03 0.12 0.06 0.00 0.08 
replicate 5 -0.17 0.22 0.17 0.31 0.16 0.08 0.06 0.17 -0.08 0.16 
replicate 11 -0.09 0.20 0.10 0.18 0.13 0.04 0.05 0.05 -0.11 0.11 
194882B Chr 3 0.03 0.43 0.30 0.49 0.19 0.12 0.07 0.16 0.13 0.30 
194884C Chr 7 -0.17 0.11 0.06 0.14 0.24 0.07 0.09 0.09 -0.08 0.09 
Isua supracrustal belt (ISB), 3.7 Gyr 

194907 D 1 -0.23 0.43 0.24 0.49 0.26 0.12 0.10 0.16 -0.20 0.30 
Mean IGC (n=14)!! -0.08 0.10 0.22 0.10 0.22 0.04 0.09 0.02 -0.03 0.06 
2 s.d. IGC" 0.33 0.35 0.14 0.08 0.20 
Seqi ultramafic complex, >3.0 Gyr 

186466 Chr 2 -0.44 0.43 -0.10 0.49 0.20 0.12 0.04 0.16 -0.29 0.30 
replicate 2 -0.14 0.43 0.60 0.49 0.13 0.12 0.04 0.16 -0.13 0.30 
186479 Chr 3 0.10 0.43 0.30 0.49 0.20. 0.12 0.08 0.16 -0.17 0.30 
Australia 

Pilbara craton, Warawoona Group, 3.5-3.2 Gyr 

Pil 16-61 Chr 3 -0.03 0.43 -0.11 0.49 0.05 0.12 0.01 0.16 -0.02 0.30 
replicate 3 -0.25 0.43 -0.17 0.49 0.05 0.12 -0.06 0.16 -005 0.30 
Canada 

Abitibi greenstone belt, 2.7 Gyr 

OKUM Kom 1 0.24 0.43 0.58 0.49 0.07. (0.12 -0.03 (0.16 0.08 0.30 


South Africa 
Bushveld igneous complex, 2.05 Gyr 


UG-2 Chr 20 =-0.12 0.08 -0.06 0.10 0.00 0.02 0.03 0.04 0.05 0.07 
replicate 29 --0.12 0.08 0.03 0.08 0.01 0.02 -0.01 0.03 -0.06 0.06 
replicate 19 -0.11 0.10 0.13 0.11 0.04 0.02 0.04 0.04 0.05 0.07 
replicate 11 -0.11 0.15 0.18 0.11 0.02 0.05 0.01 0.04 -0.02 0.10 
replicate 5 -0.08 0.47 -0.19 0.44 -0.02 0.12 -0.02 0.11 0.06 0.23 
replicate 2 -0.15 0.43 -0.34 049 -003 0.12 -0.03 0.16 0.08 0.30 
replicate 6 -0.31 0.25 0.13 0.31 -0.03 0.05 -0.02 0.07 0.08 0.10 
replicate 11 0.02 0.12 -0.01 0.16 0.03 0.03 -0.02 0.05 -0.04 0.09 
LG-6 Chr 4 -0.20 0.28 0.05 0.32 0.01 0.09 -0.01 0.09 -0.04 0.18 
Mean Bushveld (n=9)!! -0.13 0.07 -0.01 0.13 0.00 0.02 0.00 0.02 0.02 0.04 
2 s.d. Bushveld! 0.18 0.33 0.05 0.05 0.12 


*The term ‘replicate’ indicates that the Ru isotope data were obtained for a replicate digestion of the same sample powder. 

'H, harzburgite; D, dunite; Chr, chromitite; Kom, komatiite. 

‘Number of analyses of the same sample solution. 

SRuthenium isotope data are internally normalized to *°Ru/''Ru using the exponential law and are reported as € unit (0.01%) deviations from the terrestrial bracketing standard: 

Ru = (Ru/"'RUsampte /'Ru/""RUstandard — 1) X 104, where i= 96, 98, 100, 102 or 104. For samples measured n < 4 times, the quoted errors reflect the external uncertainty as defined by the standard 
deviation (2 s.d.) of replicate digestions and repeated analysis of a reference sample (see Methods). Uncertainties for samples measured n 2 4 times are given as 95% confidence intervals of the 
mean calculated as(s.d. x to 95, n-1)/J7- 

"Calculated averages for samples from IGC and Bushveld Complex with 95% confidence interval uncertainties calculated as (s.d. * to.95,n-1)//n and 2s.d. uncertainties. 

‘Elevated ©°Ru value due to an isobaric Zr interference that could not be corrected during the isotopic measurement (see Methods). Consequently, this value was not included for the 
calculation of the IGC mean value. 
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Extended Data Table 2 | Ruthenium concentration data, details about the NiS procedure and Re-Os isotope data for 
ultramafic rocks 


Ru NiS Age 
Sample _Rock* (ng g*)t n x (g)* 1870 s/188Os§ 187Re/188Os§ (Gyr)! 1879 s/188OSini.1 yOs* Ref.** 


Greenland 


Itsaq gneiss complex (IGC) 
Narssaq ultramafic body (NUB), 3.8 Gyr 


10_9 H 7.76 5 (8) x10g 0.1013 0.044 3.80 0.09845 -2.3 14 

10_11 D 8.27 8 x10g 0.1013 0.042 3.80 0.09857 -2.2 14 

South of the Isua supracrustal belt (SOISB), 3.8 Gyr 

10_27 H 3.13-6.80 5(6)x10g 0.1010 - 0.1153 0.023 - 0.233 3.81 0.09942 - 0.10002 -5.3 to +5.8 14 

Ujaragssuit Nunat complex, 3.8 Gyr 

194856 Chr 215 1 x5g 0.10471 - 0.10493 0.0008 - 0.0013 3.81 0.1046 - 0.1047 +3.9 to +4.0 33 

194857. Chr 189 1 x5g 0.10490 - 0.10809 0.01424 - 0.02906 3.90 0.10395-0.10614 +3.9to +6.1 38 

194882B Chr 12 6x10g 0.104842 - 0.105590 39 

194884C Chr 23 4x10g 

Isua supracrustal belt (ISB), 3.7 Gyr 

194907 D 2.43 - 7.06 7 x10g 0.10899 - 0.37064 0.049 - 2.96 3.80 0.1045 - 0.1772 +3.8 to +76.2 23 
2.9-5.3 0.10245 - 0.13411 0.042 - 0.707 3.70 0.089 - 0.101 -12.8 to +0.2 46 

Seqi ultramafic complex, >3.0 Gyr 

186466 Chr 53.83 2 x5g 42 

186479 — Chr 53.54 2 x5g 42 

Australia 

Pilbara craton, Warawoona Group, 3.5-3.2 Gyr 

Pil 16-61 Chr 115 1 x5g 0.10517 - 0.11926 0.0281 - 0.1688 3.46 0.0979 - 0.1130 -5.2 to +9.6 33 

Canada 

Abitibi greenstone belt, 2.7 Gyr 

OKUM Kom 4.25 4 x10g 0.269 44, 54 


South Africa 
Bushveld igneous complex, 2.05 Gyr 


UG-2 Chr 760 1 x5g 0.1483 - 0.1489 0.0916 45, 55 
LG-6 Chr 270 - 430 1 x5g 0.1254 - 0.1265 0.101 - 0.117 45, 55 


*H, harzburgite; D, dunite; Chr, chromitite; Kom, komatiite. 

‘Ruthenium concentrations reported for the same samples or samples from the same locality. Italicized values indicate Ru concentrations determined in this study as described in the Methods. 
‘Number of individual NiS digestions (n) and amount of sample material (in g) taken for each single NiS digestion processed for each sample and replicate sample analysis. In cases where 
different numbers of NiS digestions were processed for a replicate sample analysis, this is indicated by the number in parentheses. If not stated otherwise, the same number of NiS digestions 
was used for all replicates. 

5Re-Os isotope data reported for the same samples or for samples of similar composition from the same locality. Note that the different ranges reported for chromitite samples from the 
Ujaragssuit Nunat complex were not obtained for the same samples and represent data reported by different previous studies for chromitites from the same locality or sampling area. 

Age used for calculation of the initial '*’Os/"®*Os isotope composition as quoted in literature. 

Mnitial '’Os/*Os isotope composition calculated at the time of emplacement of the samples. 

*yOs values refer to the relative deviation (in %) of the calculated initial *’Os/°Os isotope compositions of samples relative to the chondritic Os isotope composition at the time of emplacement 
of the samples. 

**Data from refs. '423539894244-4654.59 ag indicated. 


Extended Data Table 3 | Parameters for mixing model shown in Fig. 3 and resulting ¢'°°Ru values for pre-late-veneer mantle 
endmember composition 


€'Ruinpre- —€!°Ru in pre- €'0Ru in pre- 


100 -1 
arn 7 Ru (ng 9°) LV mantle LV mantle min LV mantle max 


Chondrite groups 


Enstatite -0.08 0.04 81856 0.31 0.16 0.47 
Ordinary -0.29 0.03 88256 1.14 1.02 1.26 
Carbonaceous average -0.90 0.61 83856 3.56 1.19 4.74 
Carbonaceous average -0.90 0.12 83856 3.56 3.08 4.03 
Cl -0.24 0.13 637.456 1.00 0.46 1.55 
CM -0.69 0.38 81756 2.90 1.30 4.49 


Terrestrial mantle 
Modern mantle* 0.00 0.02 7.016 
Eoarchean SW Greenland mantlet 0.22 0.04 


Pre-late veneer mantle 1.4 


*The composition of the modern mantle corresponds to the mean value calculated for samples from the Bushveld complex (Extended Data Table 1). 
‘The Eoarchaean southwest Greenland mantle composition represents the mean value calculated for samples from the IGC (Extended Data Table 1). Data from refs. '©°°. 
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Skeletal inclusions in approximately 99-million-year-old amber from northern 
Myanmar provide unprecedented insights into the soft tissue and skeletal anatomy 
of minute fauna, which are not typically preserved in other depositional 


environments’ *. Among a diversity of vertebrates, seven specimens that preserve the 
skeletal remains of enantiornithine birds have previously been described’* ~, all of 
which (including at least one seemingly mature specimen) are smaller than specimens 
recovered from lithic materials. Here we describe an exceptionally well-preserved 
and diminutive bird-like skull that documents anew species, which we name 
Oculudentavis khaungraae gen. et sp. nov. The find appears to represent the smallest 
known dinosaur of the Mesozoic era, rivalling the bee hummingbird (Mellisuga 
helenae)—the smallest living bird—in size. The O. khaungraae specimen preserves 
features that hint at miniaturization constraints, including a unique pattern of cranial 
fusion and an autapomorphic ocular morphology’ that resembles the eyes of lizards. 
The conically arranged scleral ossicles define a small pupil, indicative of diurnal 
activity. Miniaturization most commonly arises in isolated environments, and the 
diminutive size of Oculudentavis is therefore consistent with previous suggestions 
that this amber formed onan island within the Trans-Tethyan arc’. The size and 
morphology of this species suggest a previously unknown bauplan, and a previously 
undetected ecology. This discovery highlights the potential of amber deposits to 
reveal the lowest limits of vertebrate body size. 


Aves, Linnaeus 1758 
Oculudentavis khaungraae gen. et sp. nov. (Fig. 1) 


Holotype. Hupoge Amber Museum (HPG)-15-3, a complete skull pre- 
served in amber; the block measures 31.5 mm x 19.5 mm x 8.5 mm and 
weighs 2.84 g. 

Etymology. The generic name Oculudentavisis derived fromthe Latin 
oculus (eye), dentes (teeth) and avis (bird). The species name khaun- 
graae is from Khaung Ra, who donated the specimen to the Hupoge 
Amber Museum. 

Locality and horizon. Cenomanianage, 98.8 + 0.6 million years ago". 
Angbamo site, Tanai township (Myitkyina district, Hukawng valley, 
Kachin province), northern Myanmar. 

Diagnosis. Very small bird with the following autapomorphies: jugal 
process of maxilla reaches caudally to the level of mid-orbit; jugal bar 
cross-section strongly angled dorsolaterally-ventromedially; triangu- 
lar, dorsolaterally oriented coronoid process on mandible; and 23 teeth 
in the upper jaw, 4 of which are located beneath the orbit. The taxon 
can be further diagnosed by the unique combination of the following 


features: slender rostrum; antorbital fenestra reduced or absent; and 
scleral ossicles spoon-shaped, longer (from external to internal margin) 
than they are wide (distance between adjacent ossicles). Although 
the medial articulations between the premaxillae and dentaries are 
discernible, high-resolution synchrotron computed tomography scans 
reveal that the bones are partially fused. We consider HPG-15-3 to be 
skeletally mature or nearly so (Supplementary Information), and con- 
sider its small size and an unusual pattern of cranial fusion (with the 
premaxilla, maxillaand nasal fused intoa single unit) to be diagnostic 
features of O. khaungraae. 


Description 
HPG-15-3 is mesorostrine (Fig. 1, Extended Data Fig. 1). The jugals are 
bowed laterally and the skull width is greatest at the caudal margin of 
the orbit. The upper jaw is wider than the lower jaw, the dentition of 
which occludes with the ventral surface of the secondary palate. 

The cranial half of the rostrum is imperforate. Nutrient foramina 
are present on the lateral surface of the premaxillae, maxillae and 
dentaries. The premaxillae are partially fused medially, such that the 
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Fig. 1| Photograph, computed tomography scans and interpretive drawings 
of the HPG-15-3 holotype of O. khaungraae.a, Photograph of the amber piece 
with skull ventrolaterally exposed. b,c, Scan (b) and drawing (c), left lateral view. 
d,e, Scan (d) and drawing (e), rostral view. f, g, Scan (f) and drawing (g), occipital 


right premaxilla canbe clearly distinguished from the leftin rostral or 
dorsal view. Articulation with the maxillae cannot be discerned, which 
suggests that these elements were fused to one another. The frontal 
processes of the premaxillae are elongate, and apparently articulate 
deeply with the frontals. The external nares have rounded rostral mar- 
gins and tapered caudal margins. 

The maxilla is long and toothed along its entire length. Caudal to the 
nares, the maxilla expands dorsolaterally until it reaches the rostral mar- 
gin of the orbit, forming alongand broad nasal process. The jugal ramus 
of the maxilla extends beyond the rostral margin of the orbit, articulating 
with the ventral surface of the jugal (Extended Data Fig. 2). Medially, the 
jugal ramus of the maxilla contacts the pterygoid—whether thisis through 
anectopterygoid or not is unclear—and ends just caudal to this contact. 
The lacrimal has well-developed dorsomedial and ventral processes, and 
avery small dorsolateral process. The lacrimal is roughly trapezoidal in 
dorsal aspect; the rostral margin is wedged between the nasal medially, 
and the maxilla laterally (Fig. 1c). There is no clear evidence of an antorbital 
fenestra. The jugal and maxilla have a long, dorsoventrally overlapping 
articulation, such that the jugal forms the entirety of the concave, ven- 
tral margin of the orbit’. The jugal is strongly compressed, bowed and 
transversally inclined such that its cross-section is angled dorsolater- 
ally-medioventrally. The caudal end of the jugal tapers dorsocaudally, 
articulating with the jugal process of the postorbital. 

The large, Y-shaped postorbital forms a complete infratemporal 
bar and demarcates the infratemporal and supratemporal fenestrae 
(Fig. 1). The quadrate is angled dorsocaudally—rostroventrally. The 
otic process forms a single articular surface (Extended Data Fig. 2c, d). 
The triangular orbital ramus is short and broad, articulating with the 
pterygoid medially. 
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view. h, i, Scan (h) and drawing (i), dorsal view. de, dentary; fr, frontal; hy, hyoid 
bone (or bones); jg, jugal; la, lacrimal; mx, maxilla; pa, parietal; pm, premaxilla; 
po, postorbital; qd, quadrate; sc, scleral ossicle; so, supraoccipital; sq, 
squamosal; th, teeth. Scale bars, 5 mm; longer scale bar belowbapplies to b-i. 


The orbits are large and entirely occupied by massive scleral rings; 
their internal margins are medially displaced (that is, the internal mar- 
gin is medial to the external margin, opposite to the in vivo condition 
in neornithines). The ring consists of 14 scleral ossicles (the modal 
number in saurians). The ossicles are large, and their length is twice 
that of their greatest width. The internal half ofeach ossicle is rounded 
andtheexternal half is strap-like, expanded in width externally. As pre- 
served, the internal diameter of the scleral ring measures slightly less 
than the long axis of the ossicles themselves. An interorbital septum 
is not visible and the mesethmoid appears to be absent. 

The frontals are long, slender and medially fused (Fig. 1). The parietals 
are unfused to the frontals, which is typical of nonornithurine birds 
(Extended Data Fig. 3d). Medially, the parietals are fused only along the 
rostral half of their contact. The parietals are large, longer than they are 
wide and domed. Their caudal margins form a delicate nuchal crest. 
The triradiate squamosal is unfused to the braincase. The occipital 
table is poorly preserved, owing to the activity of bivalves? (Extended 
Data Fig. 4g). The broad supraoccipital forms the dorsal margin of the 
large caudally opened foramen magnum (Fig. If, g). Short paraoccipital 
processes project sideways at roughly the level of the dorsal margin 
of the foramen magnum. Only the dorsal surface of the brain is clear 
in the endocast, and reveals well-developed cerebral hemispheres 
that—in dorsal view—are as wide as they are long (Extended Data Fig. 4). 

The parasphenoidal lamina is large and flat (Extended Data Fig. 2a). 
The palatal configuration resembles that of palaeognaths in having 
rostrolaterally projecting basipterygoid processes. The ptergyoids are 
cranially forked into ashort maxillary process anda longer palatine pro- 
cess. The tapered parasphenoidal rostrum extends rostrally just past 
the midpoint of the orbit. The rostral half of the palate is imperforate, 
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Fig. 2 | Proportions of the eye socket relative to the skull in HPG-15-3, 
compared to extant birds. The plot of log,)-transformed orbit length (y axis) 
versus log,)-transformed skull length for extant bird species (grey triangles) 
(n=206), with the phylogenetic generalized least squares line obtained for the 
majority-rule consensus tree for the Hackett backbone tree set (see ‘Scaling of 
eye socket and skull length’ sections inthe Methods and Supplementary 
Information for details). The skull of HPG-15-3 (red square) is smaller than any of 
the extant birds that we included, but the eye socket is about as large as 
expected ifthe fitted line is extended towards HPG-15-3. The skull drawings of 
M. minima (purple square), O. khaungraae (red square) and Gallus gallus (blue 
square) are depicted toa relative scale (scale bar equals 10 mm). The yellow and 
black squares plot the ostrich (Struthio camelus) and the greater rhea (Rhea 
americana), respectively. The silhouettes depict the body size of Oculudentavis 
relative to Mellisuga, Struthio and Gallus (colours of the silhouettes correspond 
totheir respective data points). 


formed by the medially unfused palatal processes of the premaxillae 
(Extended Data Fig. 3c). The choanae appear to be level with the exter- 
nal nares. The preserved oral soft tissue bears a pattern of elongated 
papillae’ (Extended Data Fig. 2a, b). 

The lower jaws are thin and nearly straight, with subparallel dorsal 
and ventral margins (Extended Data Fig. 5). The medial suture between 
dentariesis clearly visible; no predentary bone is present. The bounda- 
ries between postdentary bones cannot be discerned. A triangular, 
dorsolaterally oriented coronoid process is present at one quarter 
length from the caudal margin of the lower jaw. The bowed (that is, 
laterally concave) ceratobranchial elements are preserved with their 
cranial ends in contact with one another (Fig. le). 

The upper jaw contains 23 teeth on the right side; 18 teeth are pre- 
served on the left side. The rostralmost three teeth are somewhat 
procumbent. The apices of the rostralmost seven teeth are weakly 
curved lingually. The caudal teeth are conical and ventrally oriented. 
The mesial and distal margins of the teeth are gently carinated. The 
teeth lack basal constrictions and have expanded roots; they vary in 
size, crown height and spacing. The teeth are largest in the portion 
of the maxilla that is ventral to the external nares, and are smallest 
underneath the orbit. Each dentary had approximately 29 or 30 teeth, 
which resemble those in the upper jaw. The tooth geometry appears 
to be acrodont to pleurodont; no grooves or sockets are discernable. 


Discussion 

The skull of O. khaungraae reveals a general avian morphology, includ- 
ing a slender, tapering rostrum with retracted naris, an enlarged and 
well-defined eye socket, a short compressed postorbital region anda 


dome-shaped skull roof. O. knaungraae appears to represent the small- 
est known Mesozoic dinosaur, rivalling the extant bee hummingbird— 
the smallest known dinosaur of all time—in size (Fig. 2). The length of 
the postrostral portion of the skull (measured fromthe contact between 
the nasal and frontal at the sagittal line to the caudal end of the cranium) 
is 7.1mm, compared to 8.8 mm inthe vervain hummingbird (Mellisuga 
minima)—whichis only slightly larger than the bee hummingbird». No 
other group of living birds features species with similarly small crania 
in adults (Fig. 2, Extended Data Fig. 6). The discovery of Oculudenativs 
highlights the presence of diminutive members of vertebrate faunas 
that possibly can only be studied through preservation in amber, which 
represents a taphonomic filter capturing the lowest end of the body-size 
spectrum. This discovery is also consistent with hypotheses that the 
Cretaceous Burmese amber from the Angbamo site formed in anisland 
arc, as miniaturization mostcommonly arises inisland environments!°’, 

HPG-15-3 displays morphologies that, to our knowledge, have not 
previously been observed in any bird; these morphologies include fea- 
tures that depart fromthe theropod condition altogether (for example, 
spoon-shaped scleral ossicles and acrodontto pleurodont dentition), as 
well as an unusual combination of traits that are primitive (for example, 
complete postorbital bar and extensive tooth row) and advanced (for 
example, an expanded imperforate portion of the rostrum andan orbit 
confluent with antorbital fenestra) for avians. Nearly all of these unu- 
sual morphologies can be interpreted as the effects of miniaturization, 
which is commonly associated with the reappearance of plesiomorphic 
morphologies, increased bone fusion and proportionally enlarged sen- 
sory organs (for example, the eye and middle ear)**”, Miniaturization 
is most commonly associated with paedomorphism that results from 
progenesis®'®, However, the skull of Oculudentavis reveals no obvious 
paedomorphic features (for example, a proportionally large orbit and 
short rostrum), which suggests that miniaturization in this lineage may 
have been achieved by areduction in growth rate’. 

The degree of fusion between the skull bones that form each half of 
the rostrum and the patternin which the sutures have closed is highly 
unusual among nonneornithinetheropods. In early birds, the premaxil- 
lary bodies—followed by the dentaries—are consistently the first skull 
elements to fuse in both an ontogenetic and phylogenetic context, and 
these sutures are completely obliterated in mature specimens of some 
taxa (for example, Confuciusornithiformes and the enantiornithine 
Gobipteryx)”°. In Late Cretaceous ornithurines the mandibular bones 
remain unfused, although the premaxillae form a single element”. In 
Gobipteryx and neornithines, the premaxillae fuse into a single element 
during embryonic development”. The pattern of fusion in HPG-15-3 
departs from this pattern and is distinct from all other known thero- 
pods, highlighting the enigmatic nature of Oculudentavis. This fusion 
pattern may be related to structural constraints that are imparted by 
miniaturization’ and tothe predatory ecology inferred for this animal. 

Thescleral ring of Oculudentavisis very different from that preserved 
in any known Mesozoic dinosaur, in which the ossicles are typically 
nearly square-to-rectangular, narrow and demarcate a proportionally 
larger aperture (as seen, for example, in Archaeopteryx, Sapeornis and 
Yixianornis). In HPG-15-3, the scleral ring is very large and is formed 
by elongated spoon-shaped ossicles; a morphology similar to this is 
otherwise known only in lizards (for example, Lacerta viridis)”. The 
relatively small aperture defined by the scleral ring of Oculudentavis 
suggests unique visual capabilities compared to other Mesozoic thero- 
pods (Fig. 3), and indicates that the maximum size of the pupil was fairly 
small, limiting the amount of light entering the eye**. Even account- 
ing for a range of values, this morphology suggests a diurnal lifestyle 
in photopic light environments” (Fig. 3). Although both scleral rings 
appeared deformed, the elongation of the ossicles—together with the 
bowed morphology of the jugal bar—suggests that these elements would 
have defined a conical eye similar to that of some extant birds (Extended 
Data Figs. 7, 8) and lizards”. Nanoid taxa typically have enlarged eyes?” 
that result from negatively allometric scaling of the eyes to body size. 
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Fig. 3 | Astar plot illustrating the functional morphospace ofscleral-ring 
and orbit morphology of modern saurians in the context of their diel 
activity pattern. The xaxis (labelled ‘geomm’) represents the log,o- 
transformed geometric mean of orbit length and the external and internal 
scleral-ring diameters, and the yaxis (labelled ‘opt’) is formed by the optical 
ratio. Triangles indicate squamates (n= 60 species); circles indicate birds 
(n=104 species). Ellipses represent 95% confidence areas for cathemeral 
(green), diurnal (orange) and nocturnal (purple) groups, drawn around the 
respective average x and y values for each group. Density plots illustrate the 
distribution of data along thexand yaxes, delineated by groups of diel activity 
pattern. The retrodeformed eyes of HPG-15-3 are most similar in shape and size 
tothe eyes of diurnal squamates. The black oval around the data points 
representing the left and right eyes of HPG-15-3 represent the 95% confidence 
ellipse, indicating the range of values estimated by our sensitivity analysis. The 
average of the geometric mean of all eye variables and the optical ratio for the 
retrodeformed left and right eyes were calculated and used to generatea 
distribution of possible combinations spanning +30% of the left- and right-eye 
averages. 


Oculudentavis follows this pattern and falls where expected ina linear 
regression of avian orbit size relative to skull length (Fig. 2, Extended 
Data Fig. 9). The laterally bulging eyes of Oculudentavis, as indicated by 
the angled morphology of the jugal bar, probably represent an alterna- 
tive strategy for increasing eye size without further increasing the size 
of the orbit. Notably, Oculudentavis lacks the forward-facing position 
of the eyes of owls and living birds of prey; the eyes are laterally ori- 
ented such that binocular vision was absent, or largely limited. Given 
the unusual morphologies observed in HPG-15-3, extant analogues for 
the visual abilities of Oculudentavis may not exist. 

Although miniaturization is often associated with edentulism’, 
Oculudentavis possesses an extensive dentition with unusual features. 
First, the acrodont to pleurodont geometry of tooth implantation 
differs from the thecodont condition in all other dinosaurs. Second, 
the tooth rowextends ventral to the orbit; this conditionis otherwise 
only observed among coelurosaur theropods in the Late Cretaceous 
ornithurine bird /chthyornis””®. In most coelurosaurs (including Aves), 
the maxillary tooth rowtypically extends as far caudally as the rostral 
half of the antorbital fenestra (well cranial to the orbit’), although the 
tooth row extends nearly to the jugal-lacrimal contact in sometroodon- 
tids”’ and in /chthyornis one toothis located caudal to the lacrimal and 
beneath the orbit. The tooth row is even more caudally extensive in 
Oculudentavis, which has four maxillary teeth underlying the orbit. 
HPG-15-3 has more teeth than any other known Mesozoic bird, which 
highlights both the importance of dentition in the feeding strategies 
of Cretaceous avians and the existence of strong positive selection for 
teeth despite the overall avian trend towards edentulism”. Increased 
dentition in Cretaceous birds has previously been correlated with pis- 
civory”’. However, the papillae preserved on the tongue and ventral 
surface of the upper jaw of Oculudentavis (Extended Data Fig. 2a, b) 
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Fig. 4| Simplified results of the strict consensus of 2,044 trees depicting the 
phylogenetic relationships of O. khaungraae relative to other known 
Mesozoic birds. The analysis (consisting of 36 taxascored across 

257 characters) wasruninthe TNT program using implied weighting (k=16). 
Consistency index= 0.47; retention index = 0.645. Inthe analysis using a priori 
weights, Oculudentaviswas resolved as an enantiornithine ina small percentage 
of the most-parsimonious trees. For most species for which skull material is 
known, the skull is reconstructed to scale relative to O. khaungraae. Scale bar, 
2cm. Names of species for which no skull material is known are written in grey. 


lack the long, sharp and caudally oriented morphology thatis present in 
many extant piscivorous birds“. Although unexpected for its small size, 
thelarge number of teeth—as wellas their sharp and carinated morphol- 
ogy—suggests that Oculudentavis was a predator, probably feeding on 
small arthropods and otherinvertebrates (which are abundantin amber 
inclusions from the Angbamosite). The evolution of island nanism has 
beenlinked to a previously unrecognized ecological guild specialized 
in feeding on small invertebrates in Paedophryne frogs (a group that 
includes the smallest known vertebrate in the world’°), and a similar 
diet is possible for Oculudentavis. 

Oculudentavis retains the plesiomorphic diapsid condition thatis pre- 
sent in most nonornithothoracine birds, whereas the infratemporal bar 
was independently lostamongenantiornithinesand ornithuromorphs”. 
Retention of the postorbital bar suggests limited cranial kinesis”, consist- 
ent with the absence ofa distinct craniofacial hinge and with data that sug- 
gestavian cranial kinesis is mostly amorphological novelty of neognaths*. 


The largely akinetic skull morphology of Oculudentavis suggests a bite 
forcethat isrelatively stronger thanthat of aneornithine of the same size”, 
whichis supported by the unusual degree of cranial fusion”, the presence 
of a well-developed coronoid process and the absence of an antorbital 
fenestra. A distinct coronoid processis not developed in any other Meso- 
zoic bird but it is well-developed in predatory birds of the Cenozoic era, 
suchas phorusrhacids*. Inarchosaurs, this process provides attachment 
for adductor muscles®. Reductionin pneumatic features may suggest that, 
inthe Oculudentavis lineage, solid bone was favoured biomechanically 
over opportunistic pneumatization®. This reinforcement of the skull may 
reflect the predatory lifestyle inferred for Oculudentavis. Alternatively, the 
loss of the antorbital fenestra may be a consequence of miniaturization, 
and provide additional space for the enlarged orbit’. 

HPG-15-3 preserves no features that unambiguously indicate its phy- 
logenetic position, although the sum of the observed morphologies 
suggests that the taxon falls outside Ornithuromorpha. The extensive 
tooth row resembles noncoelurosaur theropods” and the ornithurine 
Ichthyornis”, whereas the presence of premaxillary teeth (which are 
absent in the ornithurines Hesperornis and Ichthyornis), complete 
infratemporal and supratemporal fenestrae (which are typically absent 
in Ornithothoraces)”, the quadrate with a broad orbital process (which 
is typically narrow in Ornithuromorpha) and the absence of a preden- 
tary (which is present in toothed Cretaceous ornithuromorphs)* all 
suggest a more-basal position. However, nonornithothoracine avian 
lineages (for example, Jeholornis and Sapeornis) are typically much 
larger than Cretaceous ornithothoracines— particularly the enantio- 
rnithines. Miniaturization is strongly associated with extreme homo- 
plasy, which obfuscates phylogeny’. Our cladistic analysis using a priori 
weights produces a poorly resolved consensus tree in which all birds 
more derived than Archaeopteryx form a polytomy (Extended Data 
Fig. 10a). Approximately 90% of all equal-length trees resolve Oculuden- 
tavis as more derived than Archaeopteryx but basal to Jeholornis; the 
remaining 10% place it within Enantiornithes. Using implied weights 
(which is recommended for taxa characterized by strong homoplasy, 
including Aves’ “’), our analysis consistently resolves Oculudentavis as 
intermediate between Archaeopteryx and Jeholornis (Fig. 4, Extended 
Data Fig. 10b, Supplementary Information). If this hypothesis were to be 
supported by additional postcranial data, HPG-15-3 could possibly rep- 
resent the first nonenantiornithine bird in the avifauna in the Hukawng 
valley deposits. Given the unusual morphology of Oculudentavis (which 
is clearly unlike that of any other bird), there is a strong potential for 
new data to markedly alter our systematic conclusion. Regardless, this 
discovery highlights the capacity of amber to document vertebrate 
diversity that is unrecorded by other depositional environments. 
Note added in proof:Since this paper was accepted, an eight specimen 
that preserves the skeletal remains of an enantiornithine bird has been 
described”. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Material and photography 

HPG-15-3 is housed and displayed in the Hupoge Amber Museum; the 3D 
printing models are available through the Dexu Institute of Palaeontol- 
ogy. HPG-15-3 was examined with a VH-Z100UT Universal Zoom Lens 
(100x to 1,000x) in an optical microscope (KEYENCE VHX-6000) with 
100x magnification at Shenyang Normal University. Photographs were 
taken using a Canon digital camera (SD Mark III, MP-E 65MM F/2.8 1-5X) 
fitted toa macro rail (Cognisys) and processed using Helicon Focus 5.1 
and Adobe Photoshop CSS software. 


Scanning and 3D reconstruction 

HPG-15-3 was imaged nondestructively using propagation phase-con- 
trast synchrotron radiation X-ray microtomography, on beamline 13W 
of the Shanghai Synchrotron Radiation Facility. The SR beam was mono- 
chromatized at 22 keV using the double Si (111) crystal monochromator. 
The distance between sample and detector (propagation distance) 
was 60 mm to obtain the phase contrast. The physical pixel size of 
the charge-coupled device sensor was 6.5 um x 6.5 um, and we used a 
2x microscope objective; the isotropic voxel size was 3.25 um. The 
pixel number of our detector was 2,048 x 2,048 and its dynamic range 
was 16 bit. The field of the view of our detector was 6.6 mm x 6.6 mm. 

Six microcomputed tomography scans under the same conditions 
were performed to image the whole specimen, owing to the fact its 
size in the vertical direction is much larger than the field of view of 
our detector. The off-axis scanning 360° mode was adopted to 3 scans 
because the size of the sample in the horizontal direction was larger 
than the field of view of the detector. In this mode, 6,000 projections 
were collected fora single computed tomography scan. The exposure 
time ofa single projection was 0.3s. 

In the preprocessing of the off-axis scanning data, image stitch- 
ing was the first step performed, considering the image alignment, 
the contrast unification and the tilt of the rotation axis. We obtained 
3,000 stitched images with enlarged horizontal fields of view (about 
6.6 mm x 13 mm) for 3,000 project angles. The phase retrieval and 
slice reconstruction of all the six projection datasets were performed 
using PITRE-3 software’. After the reconstruction, the computed- 
tomography slice sets for the six segments were stitched again in the 
vertical direction. 

The computed tomography slices for the entire sample comprised 
36 GB of data. To reduce the amount of memory used, the sampling 
interval was set as one pixel in all three directions, so that the data 
were reduced to one-eighth of their original size. The amber and all 


impurities surrounding the skull were removed using the image seg- 
mentation function. The rotation correction was performed using 
ImageJ software. The 3D data processing, segmentation and analysis 
were performed using VG StudioMax 1.2 and 2.1. The 3D morphology 
of the endocast was obtained through manual segmentation using 
Image] 1.4 and rendered using VG studio Max 2.4. 


Scaling of eye socket and skull length 

We assessed the allometry of the avian eye socket with phylogenetic 
generalized least squares“. We used a Brownian correlation matrix and 
maximum likelihood estimates ofA to summarize the slope estimates 
over 1,000 previously published time-calibrated trees*>°. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Owing to their size, the raw computed tomography data are available 
upon request fromL.X. (xinglida@gmail.com). All other materials are 
includedinthe Supplementary Information or are available at https:// 
doi.org/10.5281/zenodo.3591994. 


43. Chen, R.-C. et al. PITRE: software for phase-sensitive X-ray image processing and 
tomography reconstruction J. Synchrotron Radiat. 19, 836-845 (2012). 

44. Symonds, M.R.E. & Blomberg, S. P. in Modern Phylogenetic Comparative Methods and 
their Application in Evolutionary Biology (ed. Garamszegi, L. Z.) 105-130 (Springer, 2014). 

45. Jetz, W., Thomas, G. H., Joy, J. B., Hartmann, K. & Mooers, A. O. The global diversity of 
birds in space and time. Nature 491, 309-316 (2012). 

46. Jetz, W. et al. Distribution and conservation of global evolutionary distinctness in birds. 
Curr. Biol. 24, 919-930 (2014). 


Acknowledgements This research was funded by the National Natural Science Foundation of 
China (no. 41888101, 41790455 and 41772008), the National Geographic Society (no. ECO768-15) 
and the Natural Sciences and Engineering Research Council of Canada (2015-00681). We 
thank BL13W of the Shanghai Synchrotron Radiation Facility for beamtime access based on 
proposal 16ssrf 01737, and the Beijing Synchrotron Radiation Facility for supplying the high 
MTF imaging detector. We thank S. Abramowicz for assistance with figures and D. Blackburn, 
D. Steadman and E. Stanley for making the computed tomography scan of M. minima 
accessible. 


Author contributions L.X. and J.K.O. designed the project, L.X., J.K.O., L.M.C., L.S., R.C.M., QY. 
and G.L. performed the research: G.L. and QY. performed computed tomography scanning of 
the specimen and processed the data. L.S. performed the eye-scaling statistical analyses. 
J.K.O. performed the cladistic analysis. J.K.O., L.M.C., L.S., L.X. and G.L. wrote the manuscript. 
L.X., J.K.O., L.S. and G.L. contributed equally. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-020- 
2068-4. 

Correspondence and requests for materials should be addressed to J.K.O. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


Extended Data Fig. 1| Close-up photographs of HPG-15-3.a, Entireskullinleft | original position of skull, which drifted before the resin hardened; the black 
lateral view. b, Left eye. c, Maxillary dentition. The black arrows in aindicate arrow in bindicates the position of decay products released from the left eye. 
decay products from the soft tissue of the dorsal surface of the skull and the Scale bars, 2mm (a), 500 pm (b), 200 tm (c). 
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Extended Data Fig. 2| Computed tomography scan of HPG-15-3 in palatal square box ina indicates the region enlarged in b. bp, basipterygoid process; 
view, with the mandibles removed, and an isolated quadrate. a, Full palatal 


bs, basisphenoid plate; bsr, basisphenoid rostrum; ch, choana; dt, developing 
view. b, Close-up of the preserved lingual papillae of the roof of the mouth. 


tooth; pt, pterygoid; pp, papillae; pmc, medial contact of the palatal processes 
c, Isolated left quadrate in lateral view. d, Quadrate in caudal view. Dashed of the premaxillae. 


Extended Data Fig. 3 |Raw computed tomography slices showing the caudal portion (open). f, Image of the entire skull, showing position of the slices 
anatomy of important cranial sutures of HPG-15-3. a, Interparietal suture, shown ina-c, e (image shown in disa sagittal slice through the middle of the 
cranial portion (closed). b, Interfrontal suture (closed).c, Palatal processes of skull). Boxes outlined in dashed pink lines show the region enlarged inthe 

the premaxilla (open). d, Frontoparietal suture (open).e, Interparietal suture, insets, to clearly demonstrate the morphology of the suture or contact. 
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Extended Data Fig. 4| Rendering of the cranial endocast of HPG-15-3. endocast reveals reliable information. The white dashed lines indicate the 

a, Dorsal view. b, Ventral view. c, Caudal view. d, Cranial view. e, Right lateral portions of the endocast that probably were not occupied by brain tissue. The 
view. f, Left lateral view. g, Interior view of the brain cavity, showing the bivalve cerebrum appears to be prominent but a distinct optic lobe—as seen in other 
boring that intrudes through the ventral surface. Because the ventralsurfaceof —birds—cannot be identified. bb, bivalve boring; c, cerebrum; ot, part of the 
the cranium is damaged by a bivalve boring and the bones supporting the olfactory tract and/or olfactory lobe. 

cranial margins of the brain are not preserved, only the dorsal surface of the 


Extended Data Fig. 5 | Isolated mandible of HPG-15-3. a, Right mandible in mandibles in ventral (left) and dorsal (right) views. cor, coronoid process; mds, 
(from top to bottom) ventral, dorsal, medial and lateral views. b, Left mandible mandibular symphysis; mf?, possible mandibular foramen; nf, nutrient 
in (from top to bottom) ventral, dorsal, medial and lateral views. c, Articulated foramina. 
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HPG-15-3 hummingbirds passerines swifts other 
Birds 
Extended Data Fig. 6 | Skull size in HPG-15-3 compared to other birds. The illustrates the median (thick line) the 1st and 3rd quartiles (the hinges), andthe 
skull of HPG-15-3 is small compared to those of extant birds (total ofn=213 distance from the upper and lower hinge tothe largest and smallest valueno 
extant bird species sampled), here illustrated through a box plot of log,o- further than1.5x theinterquartile range (the whiskers). HPG-15-3 is smaller than 
transformed postnasal skull length (as a proxy for the braincase), measured swifts (n=12), passerines (n = 23) and hummingbirds (n=22)—and may even be 


from the craniofacial hinge to the caudal end of the cranium. Each box plot smaller than the smallest hummingbird, M. helenae. 


Extended Data Fig. 7 | Scleral ring of HPG-15-3.a, As preserved. b, After retrodeformation. 
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Extended Data Fig. 8 | Scleral rings of selected neornithines. a, Asio e, Chordeiles minor, common nighthawk. f, Selaphorus sasin, Allen’s 
flammeus, short-eared owl. b, Buteojamaicensis, red-tailed hawk.c, Cerorhinca hummingbird. g, Cypseloides niger, American black swift. h, Megaceryle alcyon, 
monocerata, rhinoceros auklet.d, Dendrocopus villosus, hairy woodpecker. belted kingfisher. 
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Extended Data Fig. 9 | Summary of the slope estimates obtained from 
phylogenetic generalized least squares withthe Ericson and the Hackett 
backbonetree sets, containing 1,000 trees each. a, Ericson backbone tree set. 
b, Hackett backbone tree set. Thex axes represent the estimated slope values, 
and the yaxes represent the number of the tree sampled from the entire set of 
1,000trees. Dots signify the actual slope estimate, and grey bars visualize the 
s.e. of the slope estimates. The blue dashed line is the slope value obtained from 
ordinary least square regression, and the red dashed line represents the slope 
of isometry. Results from the phylogenetic generalized least squares iterations 
(n=1,000) suggest the presence of negative allometry (slope <1) inthe relation 
between eye socket and skull length (P< 0.001). 
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Dromaeosauridae Dromaeosauridas 
| Archaeopteryx : (> Archaeopteryx 
Neuquenornis Oculudentavis 
Halimornis Jeholornis prima 
Enantiophoenix Eoconfuciusernis 
Shenqiornis Lf Changchengornis 
Protopteryx Confuciusornis sanctus 


[—— Sapeornis 
Archaeorhynchus 
Patagopteryx 


Iberomesornis 
Gobipteryx_N_ valifanovi 
Eoenantiornis 


; Longicrusavis 
Eocathayornis [= 4 Yanornis 

Eoalulavis Gansus_yumenensis 
Elsornis (—— Apsaravis 
Concornis Ichthyornis 
Cathyaornis_yandica + Hesperornis 
Oculudentavis Gallus 
Sapeornis Anas 


MTT 


Jeholornis prima 


— Pengornis 


Protopteryx 
Iberomesornis 


Eopengornis Shanweiniao 
Eoconfuciusornis Longipteryx 
Changchengornis Rapaxavis 


Confuciusornis sanctus Longirostravis 


Shanweiniao Gobipteryx N_valifanovi 
Longipteryx Pengornis” 
Rapaxavis Eopengornis 


Elsornis 

Eosenantiornis 
Neuquenornis 
Halimornis 
Enantiophoenix 
Shengiornis 
Concornis 


Longirostravis 


Archaeorhynchus 
Patagopteryx 
rp Longicrusavis 
[— Gansus_yumenensis 
[—— Yanornis 


Apsaravis Eocathayornis 
Ichthyornis = a Eoalulavis 
Hesperornis Cathyaornis _ 
Gallus 
Anas 


Extended Data Fig. 10 | Results of analysis using a priori weights and the were the same for k values of 12-20 and of 25; k values between 2 and 11 
unsimplified cladogram depicting the results of the phylogenetic analysis. differed only from higher k values in the relative placement of some derived 
For further discussion, see Supplementary Information. a, A priori weighting enantiornithines. Here Nanantius valifanovi is considered ajunior synonym of 
resultsina polytomy consisting of all taxa more derived than Archaeopteryx. Gobipteryx minuta (thus the operational taxonomic unit name is givenas 

b, Results of the analysis using implied weighting with ak value of 16; results ‘Gobipteryx_N_valifanovi’). 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection No software was used to collect data, except for the linear dimensions of the skull of Mellisuga minima: Lars Schmitz took measurements 
of a scaled digital model in Meshlab 2016.12. 


Data analysis All data were analysed in R 3.5.0 using routine phylogenetic comparative methods. Custom scripts are available in the Supplementary 
Information. Figures were made using Adobe CS 2018. Phylogenetic analysis was run using TNT 1.0. CT data was processed using ImageJ 
1.4 and VG StudioMax 1.2, 2.1 and 2.4. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


All data are available in the Supplementary Information. 
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Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 
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Study description 


Research sample 


Sampling strategy 


Data collection 


We describe the anatomy of a new fossil enclosed in Burmese amber, facilitated through synchroton microtomography. We also 
performed large-scale phylogenetic comparative analysis of the scaling of eye socket size with respect to skull length, and 
visualization of eye shape through scleral ring and eye socket anatomy. Both approaches facilitate the comparison of the fossil with 


modern birds and lizar' 


The fossil we are desc 
HPG-15-3. To faciliate 


birds. Data on eye sha 
rom a previous study 


rom the 206 species) 


Samples were chosen to reflect 
all analyses were performed wi 


Li Gang and and Yi Qiru conduc 


ds. 


ribing in this paper is curated in the Hupoge Amber Museum, identified through the unique collection number 


comparative analyses, 
birds. These samples were chosen to reflect 
ength, external and internal scleral ring diameter) and diel activity patterns of lizards and birds are 


pe (orbit 
(Schmitz and Motani, 


data on eye socket and skull length were collected for a total of 206 extant species of 
all major clades of living birds, providing an overview of the overall pattern found across 


2011, Science). 


all major clades of living birds. Whenever possible, multiple individuals per species were measured; 
h the average values of species. Regression analyses of smaller sample sizes (n=50, randomly chosen 


revealed very similar results, hence the sample size used for the final analysis is deemed adequate. 


ed the CT scans and reconstruction - HPG-15-3 was imaged nondestructively using propagation phase 
contrast Synchrotron Radiation X-ray microtomography (PPC-SRX-UCT) on beamline 13W of the Shanghai Synchrotron Radiation 
Facility (SSRF). The SR beam was monochromatized at 22 keV using the double Si (111) crystal monochromator. The distance 


between sample and detector (propagation distance) was 60 mm to obtain the phase contrast. The physical pixel size of CCD sensor 
was 6.5um?6.5um and we used a 2? microscope objective; the isotropic voxel size was 3.25 um. The pixel number of our detector 


was 2048 x 2048 and its dynamic range was 16 bit. The field of the view of our detector was 6.6 mm ? 6.6 mm. 


6 LCT scans under the same conditions were performed to image the whole specimen due to the fact its size in the vertical direction 
is much larger than the FOV of our detector. The off-axis scanning 360° mode was adopted to 3 scans because the sample’s size in 
horizontal direction was larger than detector’s FOV. In this mode 6000 projections were collected for a single CT scan. The exposure 
time of single projection was 0.3 s. 
In the off-axis scanning data preprocessing, the image stitching was the first step that always required to consider the image 
alignment, the contrast unification, and the tilt of the rotation axis. We obtained 3000 stitched images with enlarged horizontal FOV, 
about 6.6mm _ 13mm, for 3000 project angle. The phase retrieval and slice reconstruction of all the six projection data sets were 
performed using PITRE-3 software [S12]. After all the reconstruction, the six segments CT slice sets were stitched again in the vertical 


direction. 


The CT slices for the entire sample consisted of 36 GB data. To reduce the amount of memory used, the sampling interval was set as 
one pixel in all 3 directions so that the data were reduced to one-eighth of its original size. The amber and all impurities surrounding 
the skull were removed using the image segmentation function. The rotation correction was performed using ImageJ software. The 
3-D data processing, segmentation, and ana 


was obtained through manual segmentation 


Jingmai O'Connor per 
published by Atterhol 


ormed the phylogene 


elucidate the phylogenetic position of HPG- 


published data. The m 


ysis was performed using VG StudioMax 1.2 and 2.1. The 3d morphology of the endocast 
using ImageJ 1.4 and rendered using VG studio Max 2.4. 


ic analysis using TNT. HPG-15-3 was scored into an modified version of the matrix 


et al. 2018 which in turn is a modified version of the 2013 matrix published by O’Connor and Zhou. In order to 


5-3 we added five new cranial characters - O'Connor scored these characters from 


odified matrix consists of 36 birds scored across 257 morphological characters with Dromaeosauridae as the 


outgroup, which was analyzed using TNT (Goloboff etal 2008). First, a heuristic search was conducted using tree-bisection 


reconnection (TBR) re 
steps; an additional ro 


aining the single shor 


est tree from every 1,000 replications. This produced 46 trees with a length of 728 


und of TBR produced 2,044 trees of the same length, producing a large polytomy in the Nelson Strict 


Consensus tree. Early avian evolution is extremely homoplastic thus we utilized implied weighting. We explored k values from one to 


20 and 25 and found t 


hat the position of Oculudentavis stabilized at k values higher than one and that enantiornithine 


interrelationships stabilized at k values higher than 11 (enantiornithine relationships are also stable from k equals 2 to 11 and differ 


from higher k values o 


nly in the relative placement of a few deeply nested taxa). In the presented analysis we conducted a heuristic 


search using TBR, retaining the single shortest tree from every 1,000 replications with a k value of 16. This produced three most 


parsimonious trees wi 


which form a small po 


h a score of 21.3. These trees differed only in the relative placement of the most derived enantiornithines, 
tomy in the strict consensus tree (Consistency Index = 0.47; Retention Index = 0.645). 


Lars Schmitz collected data of the skull length and eye socket length of birds with digital calipers, in the collections of the Los Angeles 
County Museum of Natural History (LACM), California State University Long Beach (CSULB, now integrated with the LACM 
collections), and the Smithsonian National Museum of Natural History (USNM). Collection numbers of all specimens are provided in 
the data table. A cranium of Mellisuga minima (UF 19832) was CT scanned by Edward Stanley at the Nanoscale Research Facility at 
the University of Florida, Gainesville, and uploaded to Morphosource (identifier: $21840). CT acquisition data are provided at 
Morphosource (https://www.morphosource.org/Detail/MediaDetail/Show/media_id/39924). 
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Timing and spatial scale N/A 
Data exclusions No data were excluded. 


Reproducibility During the design phase of the study, linear dimensions were taken repeatedly for the same individual skeleton to ensure 
consistency. All repeated measurements were successful. 


Randomization Grouping of birds and lizards by diel activity pattern was taken from a previously published study. 
Blinding Blinding was not possible and also not relevant for this study. All bird specimens in museum collections are properly labeled hence it 


is impossible to not know what bird taxon is measured. Given that the main goal was to characterize the overall scaling of eye socket 
size, irrespective of taxon or ecological group, blinding was not deemed necessary. 


Did the study involve field work? Yes No 
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Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Palaeontology 


Specimen provenance Angbamo locality, Hukawng Valley, Kachin Province, Myanmar; the source and preservation of HPG-15-3 are in full compliance 
with the laws of Myanmar and China, including Myanmar’s import and export regulations of jewelry and China’s laws and 
regulations on museum. All extant bird specimens that were included in this study are curated in the collections of the LACM 
(including the CSULB specimens), USNM, and the University of Florida. 


Specimen deposition HPG-15-3 is housed and displayed in the Hupoge Amber Museum (HPG), Tengchong City Amber Association, China. 


Dating methods No new dates are provided. 


Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 


Article 


Recycling and metabolic flexibility dictate 
life inthe lower oceanic crust 


https://doi.org/10.1038/s41586-020-2075-5 


Received: 16 August 2019 


Accepted: 10 January 2020 


Published online: 11 March 2020 


® Check for updates 


Jiangtao Li'?°, Paraskevi Mara?®, Florence Schubotz*“, Jason B. Sylvan®, Gaétan Burgaud®, 
Frieder Klein’, David Beaudoin”, Shu Ying Wee®, Henry J. B. Dick”, Sarah Lott”, Rebecca Cox’, 
Lara A. E. Meyer**, Maxence Quémener®, Donna K. Blackman® & Virginia P. Edgcomb??~ 


The lithified lower oceanic crust is one of Earth’s last biological frontiers as it is 
difficult to access. It is challenging for microbiota that live in marine subsurface 
sediments or igneous basement to obtain sufficient carbon resources and energy to 
support growth’? or to meet basal power requirements‘ during periods of resource 
scarcity. Here we show how limited and unpredictable sources of carbon and energy 
dictate survival strategies used by low-biomass microbial communities that live 10— 
750 m below the seafloor at Atlantis Bank, Indian Ocean, where Earth’s lower crust is 
exposed at the seafloor. Assays of enzyme activities, lipid biomarkers, marker genes 
and microscopy indicate heterogeneously distributed and viable biomass with 
ultralow cell densities (fewer than 2,000 cells per cm’). Expression of genes involved 
in unexpected heterotrophic processes includes those witha role in the degradation 
of polyaromatic hydrocarbons, use of polyhydroxyalkanoates as carbon-storage 
molecules and recycling of amino acids to produce compounds that can participate in 
redox reactions and energy production. Our study provides insights into how 
microorganisms in the plutonic crust are able to survive within fractures or porous 
substrates by coupling sources of energy to organic and inorganic carbon resources 
that are probably delivered through the circulation of subseafloor fluids or seawater. 


Diverse and abundant microorganisms are confirmed to be present 
in the basaltic upper crust>”, yet limited information exists about the 
distinctly different gabbroic lower crust that accounts for two-thirds 
of the volume of the crust®”. Expedition 360 of the International Ocean 
Discovery Program (IODP) drilled the Atlantis Bank oceanic core com- 
plex on the southwest Indian Ridge, Indian Ocean, where the lower 
oceanic crust is exhumed by detachment faulting to around 700 m 
below sea level, providing convenient access to an otherwise largely 
inaccessible realm (Fig. land Supplementary Table 1). 

Our approach combined cell counts, microscopy and shipboard 
quantification of ATPlevelsto assess the abundance and distribution of 
microbial communities, exoenzyme activity assays, carbon, hydrogen, 
nitrogen and sulfur (CHNS) and thin-section analyses of host rock sam- 
ples, marker gene (iTAG) analyses of prokaryotic diversity, analyses of 
cultures to assess the viability and activities of selected taxa (including 
those that might be rare and may be missed by molecular approaches), 
and examination of expressed genes and lipid biomarkers. Rigorous 
efforts were made at all stages to control for contamination, includ- 
ing from drilling fluids (Supplementary Table 2) during shipboard 
sampling, and from laboratory and molecular kits (further details are 
provided in the Methods and Supplementary Information). 

Samples along the 809-m depth of hole U1473A showed evidence 
of carbonate and/or clay-altered felsic veins (variable 0.01-6 mm 


openings) within predominantly olivine gabbro, oxide gabbro and 
gabbro (Fig. 2) with around 0.1-5% porosity and 0.2-4 wt% water 
content. Borehole logging indicates temperatures of 15-18 °C. Total 
organic nitrogen (0.002-0.003 wt%) and total organic carbon (0.004- 
0.018 wt%) contents were extremely low (Supplementary Table 1). 


Detection of biomass and biomarkers 


Low-density (131-1,660 cells per cm*), heterogeneously distributed 
microbial communities were detected (Supplementary Table 3b), 
in the same range as recent studies of young, cool and oxic ridge- 
flank subseafloor basalts in the Atlantic Ocean” and samples from 
0-15 mbelow seafloor (mbsf) obtained from the 3.5-million-year-old 
Atlantis Massif". Detection of archaeal intact polar lipids (IPLs) with 
a dominance of archaeol-based lipids over glycerol dialkyl glycerol 
tetraethers (GDGTs) indicates the existence of indigenous archaeal 
communities that are distinct from those that are observed in typical 
deep-biosphere sedimentary settings” (Fig. 3b, Extended Data Fig. 1 
and Supplementary Table 3a). Bacterial diether glycerol (DEG) lipids 
were detected at all depths, which require less cellular maintenance 
than fatty-acid-containing lipids (Fig. 3c) and may indicate an adap- 
tation tothe stresses of living in the low-energy plutonic rocks of the 
deep biosphere”. 


'State Key Laboratory of Marine Geology, Tongji University, Shanghai, China. 7Department of Geology and Geophysics, Woods Hole Oceanographic Institution, Woods Hole, MA, USA. *MARUM, 
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®Université de Brest, EA 3882, Laboratoire Universitaire de Biodiversité et Ecologie Microbienne, ESIAB, Technopéle Brest-lroise, Plouzané, France. ‘Department of Marine Chemistry and 
Geochemistry, Woods Hole Oceanographic Institution, Woods Hole, MA, USA. ®Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA. These authors 
contributed equally: Jiangtao Li, Paraskevi Mara, Virginia P. Edgcomb. “e-mail: vedgcomb@whoi.edu 
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Fig. 1| Study site of the exploration of the intrusive oceanic crust byIODP 
Expedition 360. a, Map of study site. b, Schematic of lower oceanic crust 
exhumed at the seafloor at Atlantis Bank. Dashed lines indicate the detected 


The high IPL-to-core-lipid ratios support a likely in situ source of 
detected lipids. After cell death, the labile head groups of membrane 
lipids are enzymatically cleaved, leaving behind the core lipids that can 
be preserved inthe rock record over millions of years. Concentrations 
of corelipids were as lowas for their IPL counterparts, ranging between 
4and 3,700 pg g ‘of rock (Supplementary Table 3a). The resulting IPL- 
to-core-lipid ratios are high, on average more than 70% for glycosidic 
(G)-GDGTs/(G-GDGTs + core GDGTs) and 35% for glycosidic archaeols 
(G-ARs)/(G-ARs + core ARs), pointing to the minimal accumulation of 
fossil material. In comparison, IPLs typically comprise less than 10% of 
the sum of core lipids and IPLs in sedimentary settings”. 

Raman spectroscopy ofa sample from 182 mbsf shows Raman bands 
that are consistent with amino acids and sugars. Scanning electron 
microscopy shows filamentous inclusions of organic compounds 
associated with iron and manganese oxides, surrounded by, and 
inter-grown with, calcite (CaCO,), evidence of in situ origin (Fig. 3a). 
The viability of the cells is further supported by (1) the detection of 
methane production in long-term enrichments (average production 
from 9.62 x 10*-5.66 x 10 nmol CH, g7 of rock per day) (Extended 
Data Fig. 2); (2) the cultivation of more than100 unique fungal isolates 
from samples along the entire depth of the hole; (3) ATP concentrations 
above detection levels (1 pg cm’®) in 6 out of 11 rock samples (Fig. 2); 
and (4) detectable alkaline phosphatase exoenzyme activity rates of 
0.04-2.3 pmol cm™* hin all 5 samples for which measurements were 
attempted (Supplementary Table 3b). 

Marker gene (iTAG) libraries reveal low diversity (Simpson scores 
(1—D) of 0.48-0.93) of putative heterotrophic and chemoautotrophic 
taxa, many of which have been described from deep-sea and polyex- 
treme habitats (Extended Data Fig. 3). These include Nitrosopumi- 
laceae, SAR202 Chloroflexi, Nitrospinaceae, SAR4.06 Marinimicrobia, 
SAR324 Deltaproteobacteria and Thioglobaceae, among other hetero- 
geneously distributed taxa. Variation in the taxonomic composition 
among samples suggests that the availability of carbon and energy 
substrates is also heterogeneous. The SAR324 (marine group B) line- 
age of Deltaproteobacteria was detected in 8 out of 11 samples and is 
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presence of fractures along the depth of hole U1473A. The inset map is adapted 
froma previous study”; the background map was made using GeoMapApp 
(http://www.geomapapp.org/). 


known to have wide metabolic flexibility, including carbon fixation 
through Rubisco, and an ability to oxidize alkanes, methane and/or 
sulfur to generate energy”°—all potential metabolisms in the crustal 
basement. The detected DEG lipids could also be in part derived from 
these Deltaproteobacteria”. The putative sulfur and hydrogen oxidizers 
SAR406 Marinimicrobia were also detected in all samples’. 

Non-metric multidimensional scaling (Extended Data Fig. 4) indi- 
cates that the measured ATP concentrations, fracturing intensity and 
graphic lithology all had the most-significant effect on observed depth 
variance in diversity forthe 11 samples (P< 0.05). Locations of fractur- 
ing and alteration are where seawater from above, or low-temperature 
fluids from below, can most readily permeate the rock. Unfortunately, 
it was not possible to sample borehole fluids for important studies 
of in situ fluid chemistry during Expedition 360. Other studies of the 
subseafloor lithosphere show that such fluids can carry the required 
substrates to support microbial life”’. 


mRNA reveals diverse strategies 


Recovery of mRNA was lower for the shallowest samples from 11 and 
168 mbsf, consistent with observations that the top of the drill hole 
was primarily dense unaltered gabbro with few fractures to support 
microbial populations (Supplementary Table 4; further details for all 
analyses are provided inthe Supplementary Discussion). Detection of 
transcripts for various metabolic processes indicates that microorgan- 
isms have taken the prerequisite steps towards these processes. Addi- 
tional measurements of activities, for example, through stable-isotope 
probing, in addition to those performed in this study are needed to 
demonstrate the occurrence of these processes. Expression of genes 
unique to peptidoglycan synthesis was detected in5 out of 1lsamples, 
indicating that cells are engaging in cell division and/or maintenance 
of cell integrity (Supplementary Table 4). Transcripts involved in bio- 
synthesis of different amino acids were detected in10 out of 1lsamples 
as well as transcripts implicated in vitamin E and B12 synthesis (for 
example, cbiC and cbi/) in 9 samples (Supplementary Table 4). The cbi/ 
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Fig. 2|Summary of hole U1473A of IODP Expedition 360. Downhole 
properties of the gabbroic section determined by shipboard macroscopic 
descriptions and measured microbiological indicators. Labels indicate the 
property shown in each panel. Carb, carbonate. A key to lithology is included; 
rank and intensity range between O and 5. Faults are indicated by pale-green 
lines. Microbiological sample depths are highlighted by dashed blue lines. The 


sequence was annotated to Chroococcidiopsis sp., a cyanobacterial 
taxon that may contribute to anaerobic production of cobalamin or 
pseudocobalamin in the lower crust using a hydrogen-based lithoau- 
totrophic metabolism’°. 

Although we observed the production of methane in long-term 
enrichments (Extended Data Fig. 2), methanogens appear to be rare 
in our samples, explaining the low recovery of transcripts involved 
in methane metabolism (Supplementary Table 4), and no ribosomal 


252 | Nature | Vol579 | 12 March 2020 


1-m average) 


2-m average) 


light-grey horizontal zone around 500 mbsf indicates a zone that was drilled 
but not cored. ATP was measured once per sample. CCSF, core composite 
depth below seafloor. Replicate cell-count data are provided in Supplementary 
Table 3b. Source data are available at http://web.iodp.tamu.edu/DESCReport/ 
(filename, 360_U1473A_macroscopic.xlsx). 


RNA gene sequences of know methanogens were recovered (Extended 
Data Fig. 3). However, the abundance of SAR324 Deltaproteobacteria 
in our iTAG libraries indicates potential methanotrophy”° coupled 
with methane production in our long-term enrichments (Extended 
Data Fig. 2). Some methane metabolism is probably present, but its 
importanceis unclear in these samples. Detection of glycosidic archaeol 
and acyclic GDGT (GDGT-0) may indicate the presence of methanogens; 
however, these lipids may also derive from other archaeal sources. 
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Fig. 3 | Biological signatures at hole U1473A revealed by Raman 
spectroscopy and membrane lipid analyses. a, Raman spectral features of 
organic inclusions in sample 21R2 from 182 mbsf. Inset, back-scattered electron 
image of an organic inclusion. Scale bar, 100 pm. Spectral measurements were 
performed at twospots on this particular feature, 10 independent times per 
spot, with similar results. b, Diversity (left) and concentration (right) of 


Transcripts involved in nitrogen metabolism were detected in 
9/11 samples, including genes associated with denitrification and 
dissimilatory nitrite reduction to ammonium, the latter of which was 
found in zones where fracturing and vein intensity were greatest (274.6, 
460.4 and 619.6 mbsf). Transcripts involved in sulfur metabolism were 
detected in most samples (Supplementary Table 4). 

Autotrophy might be expected in the gabbroic basement given the 
likelihood of only ephemeral sources of organic carbon via circulat- 
ing fluids and the potential for seawater reactions with olivine-rich 
rocks that can produce molecular hydrogen (H,), methane (CH,) and 
other short-chain hydrocarbons that can be used as reducing agents 
for metabolic energy production”. Indeed, fluid inclusions present 
within gabbro fromthe Atlantis Bank representa source of abiotic CH, 
and H,, which can be released upon dissolution or fracturing of the 
mineral host”. Although the expression of genes that are potentially 
involved in 5 of the 6 known carbon-fixation pathways was observed 
(Supplementary Table 4), transcriptomes suggest that heterotrophy is 
also active, and that in situ microorganisms efficiently recycle and store 
the available organic compounds (Fig. 4). For example, the expression 
of transglycosylase genes associated with peptidoglycan degradation 
annotated to Desulfobacterales and Thermus was detected at 460.4 and 
643.9 mbsf. Furthermore, polycyclic aromatic hydrocarbons that can 
be formed through natural processes (for example, pyrolysis in hot- 
ter, deeper layers”’) may serve as carbon sources in the lower oceanic 
crust. We detected the expression of genes involved in polyaromatic 
and aromatic hydrocarbon degradation in 10 out of 11 samples; these 


archaeal IPLs within the 11independent samples. c, Diversity (left) and 
concentration (right) of bacterial DEGs within the 11 independent samples. The 
summed carbon chain lengths from C28 to C39 are shown. 1G, monoglycosidic; 
2G, diglycosidic; AR, archaeol. Lipid data are from single measurements owing 
tosample constraints. Source Data are available online. 


genes were primarily annotated to Pseudomonas (Supplementary 
Table 4). Recycled proteins are another important source of carbon. 
Expression of oligo-, DD-endo-, amino- and carboxypeptidases was 
detected (Supplementary Table 4). Expression of anabolic and cat- 
abolic genes implicated in the synthesis and degradation of amino 
acids known to be involved in Stickland reactions were detected in 
10 of the 11 samples, accounting for up to 24% of normalized reads, 
indicating that these reactions may havea role in the nutrient-poor 
lower ocean crust (Fig. 4). Stickland reactions involve the oxidative 
deamination of one aminoacid witha higher oxidation state, and the 
reductive deamination of another amino acid with a lower oxidation 
state, leading to ATP synthesis by substrate-level phosphorylation”. 
Heterotrophic processes such as the degradation of amino acids may be 
the dominant terminal steps of organic carbon mineralization to CO, in 
these samples; however, the incomplete coverage of expressed genes 
in the recovered metatranscriptomes means that we cannot exclude 
contributions from additional pathways, including sulfate reduction, 
that may be expressed at low levels. 

Many biochemical pathways produce acetyl-coenzyme A (acetyl- 
CoA), propionyl-CoA or acetoacetyl-CoA as intermediates that may 
contribute to the production of polyhydroxyalkanoates (PHAs)—bacte- 
rial and archaeal biopolymers knownto serve as compatible solutes in 
halophiles”*. PHAs also serve as carbon-storage molecules, reducing 
equivalents and energy sources under carbon, nutrient and oxygen 
limitation*’*. The primary role of PHAsin our samples is likely to be as 
energy storage in an environmentin which the abundance of nutrients 
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Fig. 4| Schematic representation of metabolic processes inferred from 
observed transcripts in core samples from IODP Expedition 360. As the 
expression of all of the genes in the presented pathways was not detectedin 
every sample, the schematic is presented as a working hypothesis only. PAH, 
polycyclic aromatic hydrocarbons; UDP-MurNAc, uridine diphosphate N- 
acetylmuramic acid; UDP-GIcNAc, uridine diphosphate N-acetylglucosamine. 
Amino acids are highlighted in bold. Blue boxes highlight PHA storage 
molecules, the production of which is supported by the detected transcripts. 


can vary considerably over time. PHAs such as poly-3-hydroxybutyrate 
esters can be synthesized throughan operon of three genes (phaCAB). 
Expression of phaB or phaC was detected in samples from 10.7 and 
619.6 mbsf and both were detected in the sample from 714.9 mbsf; 
these genes were primarily annotated to Pseudomonas. Propionyl- 
CoA can be used for the biosynthesis of poly(3-hydroxybutyrate-co- 
3-hydroxyvalerate), a type of PHA that is produced by bacteria and 
archaea”’. Formation of poly(3-hydroxybutyrate-co-3-hydroxyvaler- 
ate) requires the action of acetyl-CoA carboxylase and propionyl-CoA 
carboxylase. Expression of both genes was detected in the sample 
from 619.6 mbsf (Supplementary Table 4). B-Oxidation of fatty acids 
through3-ketoacyl-CoA thiolase, 3-hydroxyacyl-CoA dehydrogenase 
and acyl-CoA dehydrogenase can produce propionyl-CoA, as can the 
catabolism of amino acids using 2-oxoisovalerate dehydrogenase and 
methylmalonyl-CoA mutase. One or more of these genes was detected 
inallsamples, including those obtained from the two deepest samples 
(Fig. 4 and Supplementary Table 4). Collectively, our data suggest that 
PHAs have an important role in survival in the deep biosphere. 
Inorganic phosphorus starvation is hypothesized to occur in the 
basaltic basement” and is a possible condition in the lower oceanic 
crust. The levels of inorganic phosphorus are efficiently regulated in 
bacteria by atwo-component system” that involves ABC transporters. 
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Green boxes highlight biosynthetic processes, degradation pathways and two- 
componentsystems supported by transcripts. Yellow boxes highlight cellular 
activities supported by transcripts. Dashed red arrows represent expected 
(but unobserved) connections based on the data. The KREBS cycle (also known 
as the tricarboxylic acid cycle) generates energy via the oxidation of acetyl- 
CoA. B-Oxidation is the process of degradation of fatty acids to generate 
acetyl-CoA. The two processes have been circled. 


Expression of phosphate transporters was detected in samples from 
274.6,460.4, 558.5, 619.6 and 747.7 mbsf. Alkaline phosphatase is part of 
the phoregulon that encodes extracellular enzymes capable of obtain- 
ing inorganic phosphorus. Genes of the pho regulon were expressed in 
samples from 619.6 and 643.9 mbsf, consistent with the detected alka- 
line phosphatase activity (Supplementary Table 3b). We hypothesize 
that recycled organic carbon may be used as a source of phosphorus 
for cells in this environment (Supplementary Tables 3, 4). 

Our transcriptome data of life inthe lower oceanic crust reveal het- 
erotrophic activities that reflect the competition for limited and spo- 
radically available resources, adaptations for withstanding long periods 
of resource scarcity, and efficient recycling of pools of organic matter 
in this challenging environment (further discussion of the results is 
provided in the Supplementary Discussion). 


Conclusion 

Circulation of fluids through fault zones and fractures in the lower 
oceanic crust can facilitate delivery of volatiles (for example, H,, 
CO, and CH,), nutrients, and abiotic and biotic electron donors and 
acceptors. Fluid pathways may represent advantageous habitats for 
microorganisms, yet the distribution of life in the deep crust remains 


poorly understood. We reveal alow-biomass, but viable and/or active 
community in the exhumed lower oceanic crust, expanding our view 
of the extent of Earth’s biosphere. Cellular activities appear to be very 
low based on enzyme activity measurements and mRNA recovery. 
Unexpectedly, heterotrophic processes may dominate over more- 
familiar autotrophic processes found at seafloor hydrothermal vents 
and in shallow marine sediments. Microorganisms can adapt to life in 
this ‘slow lane’ at least in part by using available fermentable organic 
molecules*°. Given the global expanse of the lower oceanic crust within 
known temperature limits for life, even low-biomass and slow-growing 
communities may make non-trivial contributions to global nutrient 
cycling. Future exploration of deeper lower ocean crust that is not 
exposed to faulting is required to determine whether the diversity and 
activities of microbiota present at those locations are similar to those 
found below the Atlantis Bank. 
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Methods 


Data reporting 

No power analyses were done to pre-estimate sample size. The sample 
size was determined by sample availability. The molecular control 
samples and actual samples were identified by different numbers and 
were treated blindly until the final stages during which these labels were 
replaced with the actual depth-associated identifiers. 


Study site 

The Atlantis Bank was uplifted from beneath sheeted dikes and pillow 
lavas inthe local rift valley” at the palaeo ridge-transform intersection. 
On the crest of the bank, there is a 25-km? wave-cut platform that is 
the location of an 809-m hole (U1473A) drilled during IODP Expedi- 
tion 360 (32° 42.3402’ S, 57° 16.6910’ E) along a plate-spreading flow 
line at the centre of the gabbro massif. The eroded platform exposes 
massive foliated gabbro and oxide gabbro mylonites with enclaves 
of un-deformed olivine gabbro™. Oolitic sands and deep subaerial 
weathering occur on the flanks of the Bank down to 2,500 m depth, 
indicating the massif once rose to as much as 1.8-km above sea level, 
before subsiding to its present 700 m depth”, where it lies more than 
3 km above the surrounding seafloor of the same age. Flexural uplift at 
the inside corner high can only account for around 1 km of this uplift, 
the rest is associated with later normal faulting accompanying isostatic 
uplift that occurred around 12 million years ago™®**. A notable feature 
of the gabbro massif is that the boundary between crust and mantle 
rock is exposed along the transform valley wall for some 30 km. This 
structural geometry suggests that the overlying gabbro massif is in 
direct contact with partially serpentinized peridotite at a depth of 
around 4,500 m below site U1473A*. 


Sampling 

Hole U1473A was drilled with a rotary coring bit, recovering cored rock 
in 10-m increments within core barrels that were brought on deck of 
the drill ship JO/DES Resolution and laid out on trays before splitting 
the cored rock samples into approximately 10-20-cm-long whole round 
core sections that were immediately selected and removed from the 
coreliner for microbiological analysis by scientists who wore gloves and 
masks; each rock sample was placed into asterile plastic bag for transit 
to the microbiology laboratory. Prioritized core samples exhibited 
evidence of low-temperature alterations by aqueous fluids, including 
carbonate veins and clay-altered felsic veins. We collected 68 samples 
for microbiological analysis; 11 of these—spanning the depth of hole 
U1473A—were selected for analyses from among those that showed 
the highest concentration of altered felsic and/or carbonate veins. 

In the microbiology laboratory on-board the ship, each core sec- 
tion was rinsed three times with sterile distilled and deionized water, 
changing the bag each time, and then sprayed with 200-proof ethanol. 
Atthis point, after around 5 min, the core section was transferredtoa 
custom laminar flow hood equipped with a HEPA filter and air supply 
that maintained positive air pressure. Within the hood, samples were 
photographed and the exterior approximately 1-2 cm of each whole 
round core was removed using a sterile chisel and rock hammer withina 
sterilized custom-made 0.3-m x 0.2-m x 0.1-m stainless-steel rock box. 
After removing core exteriors, core interiors were divided for different 
analyses. Three times during the cruise (during the first, second and 
third week of drilling), 1-litre samples of drilling fluid were filtered 
onto 45-mm 0.2-um-pore-size Millipore Express Plus polycarbonate 
filters and frozen at—80 °C for DNA and RNA analysis. Atthesametime, 
50-ml samples of drilling mud (Sepiolite) were also collected and fro- 
zen. Materials for DNA, RNA and lipid analyses were stored in sterile 
50-ml Falcon tubes, whereas microscopy and thin-section samples were 
placed inside sterilized aluminium foil. These samples were carefully 
labelled, placed within plastic bags and immediately frozen at -80 °C. 
All equipment was rinsed and flame-sterilized between samples. 


Contamination controls 

Rotary coring contaminates the exteriors of core samples due to the 
circulation of drilling fluids (a mixture of Sepiolite and surface seawater) 
around the drilling bits. Extreme care was taken to remove or minimize 
this contamination and to not introduce new contaminations during 
sample handling and analysis. During IODP Expedition 360, anewless- 
volatile tracer, perfluoromethyldecalin, was successfully tested and 
calibrated and thus used to quantify the intrusion of drilling fluids into 
the interior of samples. Asa further control for laboratory contamina- 
tion, open Petri dishes containing microbiological medium used to 
culture fungi were placed inside the laminar flow hood during sample 
processing and these plates then were stored at room temperature for 
the duration of the cruise. In addition, extraction blanks, procedural 
controls and samples (of drilling fluids) were analysed for lipid and 
nucleic acid analyses (see the ‘Lipid extraction and UHPLC-MS’, ‘DNA 
extraction and small subunit ribosomal-RNA marker-gene analysis’ and 
‘RNA extraction and metatranscriptome analysis’ sections). 


Cell counts 

Paraformaldehyde (4 ml; 4% solution in100 mM phosphate-buffered 
saline) was added to autoclaved 7-ml plastic tubes. Then, 1 ml of pow- 
dered rock material was added to each tube, bringing the volume to 
5 ml total. Two replicate tubes were prepared for cell counts for each 
sample and—where possible—vein material alone was aliquoted into one 
sample and whole-rock powder was aliquoted into the other. Preserved 
samples were stored at 4 °C for onshore analysis. For cell counts, 1 ml 
of the fixed sample slurry was used in the quantification procedure*, 
which was modified to use 40 cycles of sonication instead of 20 to 
better release cells attached to the powdered rock. Cells were enumer- 
ated on filters stained with a 1:40 dilution of the stock SYBR Green| in 
TE buffer by counting either around 400 fields of view if fewer than a 
total of 40 cells were detected, or at least 40-50 cells in fewer fields 
when possible. The limit of quantification was defined as 3x the s.d. 
of the mean of the negative-control counts. One negative control was 
processed andanalysed for every 11 experimental samples. Allsamples 
were counted in duplicate. 


ATP and exoenzyme assays 

ATP concentration was assessed for all samples on board the JO/DES 
Resolution using luminescence methods* with the ATP Biolumines- 
cence Assay Kit (Sigma-Aldrich) anda Turner Designs BioSystems 20/20 
luminometer (Promega). The presence of ATP is indicative of microbial 
biomass. A standard curve of 0 (sterile MilliQ water), 1 and 100 ngI* 
ATP standard was run with each analysed set of samples. ATP in core 
samples was measured by placing 500 ul of ATP Assay Mix into aclean, 
sterile microcentrifuge tube after which the tube was incubated at 
room temperature for 3 min to allow hydrolysis of endogenous ATP, 
thus decreasing the background signal. Then around 0.5 cm? of pow- 
dered sample was added (all tubes were weighed) to the same tube and 
the solution was immediately transferred to a clean 1.9-ml screw-top 
glass vial (acid washed) with the cap off, using a 1-ml pipette tip with 
the tip cut off. The vial was placed into the luminometer and results 
were read immediately. From the beginning of the expedition until 
drilling reached 218 mbsf, samples for ATP assays used leftover pow- 
dered rockin the steel rock box after separation of sample material for 
other assays. After observing largely undetectable ATP concentration, 
asecond approach was used because microbial cells—if present—are 
likely to be more concentrated along cracks, within veins and vugs. 
Thus, starting at 218 mbsf, material from these specific features were 
included for these analyses. 

Alkaline phosphatase activity was measured using fluorogenic 
substrate 4-methylumbelliferyl phosphate (Sigma-Aldrich) and its 
reference standard, methylumbelliferone. Fluorescence was meas- 
ured using black, flat-bottom, 96-well microplates in a Spark 1OM 


Multimode Microplate Reader (Tecan). Fluorescence of methylumbel- 
liferone is greatest at pH 10, therefore 25 pl of 0.4 M NaOH was added 
to the wells (final concentration 40 mM) to be read. Then, 25 pl of 1M 
EDTA was added tothe wells (100 mM final concentration) to prevent 
precipitation of carbonate from sampled veins”. Fluorescence was 
measured with an excitation wavelength of 380 nm and emission of 
454 nm for all substrates and standards. We mixed 1 cm? powdered 
rock with 5 cm? of sterile artificial seawater (ASW) ina 8-ml serum vial 
with 90:5:5 N,:CO,:H, headspace. Then, 700 ul of each slurry was trans- 
ferred with a sterile syringe into a 1.5-ml Eppendorf tube after set-up 
but before sealing the vial; this sample served as Ty, which comprised 
triplicate 200 pl technical replicates. The 700-p1l samples were briefly 
centrifuged (60s at 2,500 rpm) and the supernatant was used for the 7, 
analyses. Two additional samples were taken using the same methods as 
for T, after at least 2 weeks and then again after 4-6 weeks to generate 
aslope of activity. Incubations were kept at 10 °C, the inferred in situ 
temperature, for the duration of each assay. 

Autoclaved, powdered rock from each of the samples was tested 
to determine the amount of fluorophore adsorbance to rock powder. 
Adsorbance was found to behave in a systematic way, resulting ina 
straight line when comparing fluorescence standards in ASW alone 
with fluorescence standards plus rock powder in ASW, although this 
relationship was found to be different when measured after 4h versus 
days later. Therefore, a correction factor for adsorbance was applied 
to the enzyme data for the initial measurement (79, y =1.90x — 676), 
which was taken less than 2 h after experiment initiation, versus the 
second and third measurements (7, and T,, y= 4.64x — 303), which were 
taken days to weeks later. Negative controls consisting of the same ASW 
used for the sample incubations plus substrate, but no sample, were 
consistently below the limit of detection. The limit of quantification 
for the alkaline phosphatase assay, defined as 3x the s.d. of the blank, 
was 0.0242 pmol cm® of rock h based on the analysis of 8 blanks. 


Carbon and nitrogen analysis 

Powdered rock material from each sample (produced in the laminar 
flow hood on the /O/DES Resolution using a sterile mortar and pestle) 
was immediately transferred to sterile, muffled glass containers and 
stored ina desiccator until analysis of carbon and nitrogen according to 
established methods**””. In brief, samples were weighed into methanol- 
rinsed silver boats (4mm x 6 mm, Costech). Then, 96-well glass plates 
(combusted for 4 h at 450 °C) holding these samples were placed ina 
vacuum desiccator that also contained an open dish with about 50 ml 
fresh, concentrated (12 N) HCI. An inverted crystallization dish was 
placed over the samples to protect them from water condensation. The 
desiccator was closed and pumped out with an air-driven aspirator, toa 
reading of about 0.5 atm and the desiccator was placedin an oven kept 
between 60 and65 °C. Acidification was allowed to run for 60 to 72h, as 
described previously*®. When acidification was complete, the samples 
were removed and set in the ovento dry (60-65 °C). Subsequently, the 
samples were placed in a vacuum desiccator charged with indicating 
silica gel (Fisher S162-500, activated by heating above 220 °C for sev- 
eral hours) and pumped down again and dried for about 24h. Samples 
were then analysed ona Costech 4010 Elemental Analyzer connected 
via a Finningan-MAT Conflo-Il interface to a DeltaPlus isotope ratio 
mass spectrometer. 


Thin-section preparation, scanning electron microscopy and 
Raman analyses 

Thin-section billets were cut from dedicated subsamples of the 11sam- 
ples examined inthis study. Thin sections were prepared by High Mesa 
Petrographics. Mosaic images were taken of all thin sections in transmit- 
ted and reflected light and were used to guide the scanning electron 
microscopy (SEM) and Raman analyses. Uncoated thin sections were 
screened with SEM in low-vacuum mode to search for and visualize 
carbon-richinclusions of possible organic origin. SEM was performed 


ona Hitachi TM3000 scanning electron microscope equipped witha 
Bruker energy-dispersive spectroscopy system for imaging and semi- 
quantitative element analysis. Promising samples with possible organic 
inclusions were analysed using a computer-controlled, high-resolution 
confocal Raman system (Horiba LabRam HR) equipped with three 
lasers (633 nm, 532 nm and 473 nm), a motorized stage and a SWIFT 
fast-mapping option. Confocal Raman spectroscopy enables the non- 
destructive analysis and recognition of living and fossil (once-living) 
microorganisms in altered igneous rocks. The achievable lateral and 
spectral resolution of this instrument is better than 1 1m and 2cm", 
respectively. Spectra were analysed using the BioRad Knowitall soft- 
ware and spectral databases to identify organic compounds. 


Lipid extraction and UHPLC-MS lipid biomarker analysis 
Crushed core samples stored in Falcon tubes at -80 °C were first milled 
for 10 mintoa fine powder and subsequently extracted witha modified 
Bligh and Dyer method according to a previous study“. Before mill- 
ing and extraction of eachsample, a procedure blank was performed. 
First, a milling blank was performed using combusted sea sand (fired 
at 450 °C for 5 h) toclean the mill and to limit cross-contamination of 
samples. Subsequently, this sea sand was transferred to geo-cleaned 
(rinsed three times with a mixture of methanol and dichloromethane) 
Teflon containers used for the extraction of the samples and solvent- 
extracted inthe same manner as the samples. For this, 100 ng of an inter- 
nal standard (C46 glycerol trialkyl glycerol tetraether) and around 50 ml 
of asolvent mixture of dichloromethane:methanol:buffer (2:1:0.8, v/v) 
was added to the sample in the Teflon container and ultrasonicated 
for 10 min using a geo-cleaned ultrasonic stick. After ultrasonication, 
the samples were centrifuged (1,750 rpm at 10 min) and the superna- 
tant was transferred to a fired separatory funnel. The samples were 
extracted in four steps, for the first two steps a phosphate buffer 
(K,HPO,, 50 mM at pH 7.4) was used, in the third step the phosphate 
buffer was replaced by 5% trichloroacetic acid (50 g I at pH 2) and in 
the last step only dichloromethane:methanol (9:1, v/v) was used. Equal 
amounts of dichloromethane and deionized MilliQ water were added to 
the extract collected in the separatory funnel, the mixture was shaken, 
and the organic phase was collected as the total lipid extract and blown 
to dryness under a gentle stream of nitrogen. 

Analiquot of the total lipid extract was analysed using ultrahigh-pres- 
sure liquid chromatography (UHPLC) coupled to mass spectrometry 
(MS) ona Dionex Ultimate 3000RS UHPLC connected to an ABSciEX 
QTRAP4500 Triple Quadrupole/lon Trap MS (UHPLC-triple quad-MS) 
using a Turbolon electrospray ion (ESI) source. Separation of com- 
pounds was achieved on a Waters Acquity BEH C18 column (1.7 pm, 
2.1mm x 150 mm) equipped witha guard column of the same material 
following a previously published protocol”. Compounds of interest 
were screened with multiple-reaction monitoring and selected-ion 
monitoring techniques as described previously. Concentrations of 
lipids were determined relative to the internal C46 glycerol trialkyl glyc- 
erol tetraether standard and were corrected for individual response fac- 
tors using commercially available standards (diC16-DEG, archaeol) and 
isolated standards from cultures (GDGT-O, 1G-AR, 2G-AR, 1G-GDGT-O 
and 2G-GDGT-O). Sciex Analyst 1.6.3 and Sciex MultiQuant 3.0.3 (AB 
Sciex) were used for triple-quadrupole MS data acquisition and data 
processing. 

The presence of crenarchaeol was confirmed by coreGDGT analysis 
according toa previously published study*. In brief, an aliquot of the 
total lipid extract was analysed using a Dionex Ultimate 3000RS UHPLC 
connected toa Bruker maxis ultrahigh-resolution quadrupole time-of- 
flight MS, equipped with an atmospheric pressure chemical ionization 
(APCI) II source. Compounds were separated using two aquity BEH 
HILIC amide columns (1.7 pm, 2.1mm x 300 mm) intandem maintained 
at 50 °C, and n-hexane as eluent A and n-hexane:isopropanol (90:10, 
v/v) as eluent B (a detailed protocol has been published previously’). 
Drilling mud and extraction blank contamination controls were also 
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run for lipid biomarker analyses (Extended Data Figs. 5, 6). Bruker Com- 
pass 1.9 and Bruker data analysis v.4.4 (Bruker Daltonics) were used for 
quadrupole time-of-flight data acquisition and processing. 


DNA extraction and small subunit ribosomal-RNA marker-gene 
analysis 

Rock material was crushed while still frozen in a Progressive Exploration 
Jaw Crusher (Model 150), the surfaces of which were sterilized with 70% 
ethanol and RNase AWAY (Thermo Fisher Scientific) inside a laminar 
flow hood. Powdered rock material was returned tothe -80 °C freezer 
until extraction. DNA was extracted from 20, 30 or 40 g of powdered 
rock material, depending on the quantity of rock available. A DNeasy 
PowerMax Soil Kit (Qiagen) was used according to the manufacturer’s 
protocol modified to included three freeze-thaw treatments before 
the addition of Soil Kit solution C1. Each treatment consisted of 1 min 
in liquid nitrogen followed by 5 min at 65 °C. DNA extracts were con- 
centrated by isopropanol precipitation overnight at 4 °C. The lowbio- 
mass in our samples required whole-genome amplification before 
PCR amplification of marker genes. Genomic DNA was amplified by 
multiple displacement amplification using the REPLI-g Single Cell Kit 
(Qiagen) as described. Multiple displacement amplification bias was 
minimized by splitting each whole-genome amplification sample into 
triplicate 16 pl reactions after 1h of amplification and then resuming 
amplification for the manufacturer-specified 7 h (8 h total). DNA was 
also recovered from samples of drilling mud and drilling fluid (surface 
water collected during the coring process) for negative controls, as well 
as two ‘kit control’ samples, in which no sample was added, to account 
for any contaminants originating from either the DNeasy PowerMax 
Soil Kit or the REPLI-g Single Cell Kit. Bacterial small subunit ribosomal 
RNA (rRNA) gene fragments were PCR amplified from multiple displace- 
ment amplification samples and sequenced at the Georgia Genomics 
and Bioinformatics Core (University of Georgia). The primers used 
were: Bac515-Y and Bac926R“. Dual-indexed libraries were prepared 
with (HT) iTruS (Kappa Biosystems) chemistry and sequencing was 
performed on an Illumina MiSeq 2x 300-bp system with all samples 
combined equally ona single flow cell. 

Raw sequence reads were processed using Trim Galore (http://www. 
bioinformatics.babraham.ac.uk/projects/trim_galore/), FLASH (http:// 
ccb.jhu.edu/software/FLASH/) and FASTX Toolkit (http://hannonlab. 
cshl.edu/fastx_toolkit/) for trimming and removal of low-quality and/ 
or short reads. Quality filtering included requiring aminimum average 
quality of 25 and rejection of paired reads that were less than 250 nucle- 
otides. Operational taxonomic unit (OTU) clusters were constructed 
at 99% similarity with the script pick_otus.py within the Quantitative 
Insights Into Microbial Ecology (QIIME) v.1.9.1 software® and uclust. 
Any OTU that matched an OTU in one of our control samples (drilling 
fluids, drilling mud, extraction and whole-genome amplification con- 
trols) was removed (using filter_otus_from_otu_table.py) along with any 
sequences of land plants and human pathogens that may have survived 
the control filtering due to clustering at 99% (filter_taxa_from_otu_table. 
py). As an additional quality-control measure, genera that are com- 
monly identified as PCR contaminants were removed**“”, 

Unclassified OTUs were queried using BLAST against the GenBank 
nr database and further information about these OTUs is providedin 
the Supplementary Discussion. OTUs that could not be assigned to 
Bacteria or Archaea were removed from further analysis. For down- 
stream analyses, any OTUs not representing more than 0.01% of the 
relative abundance of sequences overall were removed as those are 
unlikely to contribute considerably to in situ communities. The OTU 
data table was transformed to a presence or absence table and the 
Jaccard method was used to generate a distance matrix using the dist. 
binary( function inthe R package ade4. A hierarchical clustering den- 
drogram was created using hclust() and the stability of the clusters 
was evaluated using the clusterboot() function in the fpc package in 
Rwith 500 iterations. 


RNA extraction and metatranscriptome analysis 

Frozen rock material was crushed as above and then ground quickly 
into afine powder using a precooled sterilized mortar and pestle, after 
whichRNA extraction started immediately. The jaw crusher was cleaned 
and rinsed with 70% ethanol and RNaseZap RNase Decontamination 
Solution (Invitrogen) between samples. About 40 g of material was 
extracted for each sample using the RNeasy PowerSoil Total RNA Iso- 
lation Kit (Qiagen) according to the manufacturer’s protocol with the 
following modifications. Each sample was evenly divided into 8 bead 
tubes (Qiagen) and then 2.5 ml of Bead solution was added into the bead 
tube followed by 0.25 ml of solution SR1 and 0.8 ml of solution SR2. 
Bead tubes were frozen in liquid nitrogen and then thawed at 65 °Cin 
a water bath three times. RNA was purified using the MEGAclear Tran- 
scription Clean-up Kit (Ambion) and concentrated with an overnight 
isopropanol precipitation at 4 °C. Trace amounts of contaminating DNA 
were removed from the RNA extracts using TURBO DNA-free (Invitro- 
gen, USA) as described by the manufacturer. To ensure that DNA was 
removed thoroughly, each RNA extract was treated twice with TURBO 
DNase (Invitrogen). A nested PCR reaction (2x 35 cycles) using bacterial 
primers* was used to confirm the absence of DNA inour RNA solutions. 

RNA was converted to cDNA using the Ovation RNA-Seq System V2 
kit (NUGEN) according to the manufacturer’s protocol to preferentially 
prime non-rRNA sequences. The cDNA was purified with the MinElute 
Reaction Cleanup Kit (Qiagen) and eluted into 20 ul elution buffer. 
Extracts were quantified using a Qubit Fluorometer (Life Technolo- 
gies) and cDNAs were stored at —80 °C until sequencing using 150-bp 
paired-end reads on an Illumina NextSeq 550. 

To control for potential contaminants introduced during drilling, 
sample handling and laboratory kit reagents, we sequenced a number 
of control samples as described above. This included two samples 
that controlled for potential nucleic acid contamination, a ‘method’ 
control to monitor possible contamination from our laboratory 
extractions, which included around 40 g sterilized glass beads pro- 
cessed through the entire protocol in place of rock, and a‘kit’ control 
to account for any signal coming from trace contaminants in kit rea- 
gents, which received no addition. In addition, three field controls were 
extracted: a sample of the drilling mud (Sepiolite), and two drilling 
seawater samples collected during the first and third weeks of drilling. 
cDNA obtained from these controls was sequenced together with the 
rock samples and co-assembled. 

Trimmomatic (v.0.32)” was used to trim adaptor sequences (lead- 
ing = 20, trailing = 20, sliding window= 04:24, minlen=50). Paired reads 
were further quality checked and trimmed using FastQC (v.0.11.7) and 
FASTX toolkit (v.0.014). Downstream analyses used paired reads. After 
co-assembling reads with Trinity (v.2.4.0) from all controls (minimum 
length, 150 bp), Bowtie2 (v.2.3.4.1)°° was used (with the parameter ‘un- 
conc’) to align all sample reads to this co-assembly. Reads that mapped 
to our control co-assembly allowing 1 mismatch were removed from 
further analysis (23.5-68.5% of sequences remained in sample data- 
sets; Supplementary Table 4). Trinity (v.2.4.0) was used for de novo 
assembly of the remaining reads in sample datasets (minimum length, 
150 bp). Bowtie aligner was used to align reads to assembled contigs, 
RSEM* was used to estimate the expression level of these reads, and 
TMM was used to perform cross-sample normalization and to gener- 
ate a TMM-normalized expression matrix. Within the Trinotate suite, 
TransDecoder (v.3.0.1) was used to identify coding regions within con- 
tigs and functional and taxonomic annotation was made by BLASTx 
and BLASTp against UniProt, Swissprot (release 2018 02) and RefSeq 
non-redundant (nr) protein sequence databases (e-value threshold 
of 1x 10°). BLASTp was used to look for sequence homologies with 
the same e values. HMMER (v.3.1b2) was used to identify conserved 
domains by searching against the Pfam (v 31.0) database™. SignalP 
(v.4.1)°? and TMHMM (2.0c)* were used to predict signal peptides and 
transmembrane domains. RNAMMER (v.1.2)° was used to identify 


rRNA homologies of archaea, bacteria and eukaryotes. Because the 
Swissprot database does not have extensive representation of protein 
sequences from environmental samples, particularly deep-sea and 
deep-biosphere samples, annotations of contigs used for analyses 
of selected processes that we report were manually cross checked by 
BLASTx against the GenBank nr database. 

Aside from removing any reads that mapped well to our control 
co-assembly (1 mismatch), as an extra precaution, any sequence that 
exhibited >95% sequence identity over >80% of the sequence length 
to suspected contaminants (for example, human pathogens, plants or 
taxaknowntobecommon molecular kit reagent contaminants, and not 
described from the marine environment) as described previously**”® 
were removed. This conservative approach potentially removed envi- 
ronmentally relevant data that were annotated to suspected contami- 
nants due to poor taxonomic representation from environmental taxa 
in public databases; however, it affords the highest possible confidence 
about any transcripts discussed. Additional functional annotations 
of contigs were obtained by BLAST against the KEGG, COG, SEED and 
MetaCyc databases using MetaPathways (v.2.0) to gain insights into 
particular cellular processes, and to provide overviews of metabolic 
functions across samples based on comparisons of fragments per 
kilobase per million (FPKM)-normalized data. All annotations were 
integrated into a SQLite database for further analysis. 


Statistics 

Ocean-drilling legs with diverse sampling requirements can provide 
only a limited volume of rock material for microbiology from each 
depth horizon. Although still providing tremendously valuable insights 
into the deep lithified biosphere, the absence of replicate samples limits 
thetypes of statistical analyses that we can perform. A clustering analy- 
sis of curated transcripts within the functional categories presented in 
this paper was performed using log + 1-transformed FPKM values and 
the Ward method, and the distance matrix was constructed using the 
Manhattan method and pvclust in R. Non-metric multidimensional 
scaling (NMDS) analysis was performed on theJaccard distance matrix 
of prokaryotic OTU presence or absence data for 11samples but not on 
transcript data because we report here only acarefully curated subset 
of total reads for each sample. 

Taxonomic assignments of discussed contigs are presented in Sup- 
plementary Table 4, and were selected from among the top-10 BLASTx 
hits. In cases where different taxa were included within the top-10 hits 
for aconserved domain, either the top hit or the top consensus hit to 
a known marine group was selected. 


Fungal culturing 

Culturing efforts to specifically isolate fungi were performed using 
several culture conditions using not only classical culture-based 
approaches but also laser-nephelometry-based high-throughput 
culturing. Three different culture media were used (a modified Sab- 
ouraud (1g 1 mycological peptone, 3% sea salts, 15 g I" agar), a yeast 
nitrogen base (7 gl" YNB powder (Difco), 3% sea salts, 15 g I" agar), a 
minimal mediumas defined previously” and formulated with or with- 
out crushed oceanic crustal rock (10 g I), with variable carbon sources 
(galactose, celluloses or chitin at 15 g I“), witha mix of essential amino 
acids, and with or without a mix of antibiotics (chloramphenicol and 
penicillin G). Each experiment was processed at different tempera- 
tures ranging from 5 to 35 °C. After isolation, a dereplication step by 
mini- or microsatellite-primed (MSP)-PCR was processed to obtain 
unique isolates*® that were identified based on sequencing of several 
genetic markers (ITS, 18S, 28S or 26S rRNA genes, RPB2, tubulin and 
actin genes). 


Methane analysis 


Roughly 5 cm? of rock from depths of 247.71, 279.55, 621.09, 724.68 
and 747.78 mbsf were added to sterile ASW to 27 ml total in 30-ml 


serum vials. The control for the methane measurements was a vial 
of 27 ml of ASW with no rocks added. The vials were sealed with 
butyl stoppers and gassed with a mixture of 90% N,, 5% H, and 5% 
CO,, and incubated at 10 °C. Methane in the headspace of the vials 
was measured at 25 weeks and 60 weeks after incubation. The gas 
in the headspace of the vials was extracted and measured using gas 
chromatography-flame ionization detection (GC-FID), following 
protocols similar to previously described methods”. Calibrations 
were performed for the measurements using methane tanks at O ppm, 
1.0 ppm and 6.41 ppm, and the GC-FID loop was flushed for 1 min 
before each calibration measurement. The syringe used to inject the 
gas samples into the GC-FID loop was flushed three times with the 
90%N,,5% H, and 5% CO, gas mixture before each sample extraction 
and injection. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


iTAG data for the 11 samples as well as 13 different negative controls 
(drilling muds and fluids, seawater and kit controls) are deposited in 
the NCBI BioProject under accession number PRJNA497074. Results 
are presented at approximately phylum level; however, taxonomic 
assignments at finer resolution are available from the correspond- 
ing author upon request. Raw reads for transcriptome data have also 
been deposited in BioProject under accession number PRJNA497074; 
iTAGs have been deposited in the NCBI SRA under accession numbers 
SRR8136794-SRR8136814 and transcript raw reads canbe found inthe 
SRA under accession numbers SRR8141073-SRR8141077. Assemblies 
for curated portions of the data presented are available upon request 
to the corresponding author. All relevant data are available from the 
corresponding author or are included with the manuscript as Sup- 
plementary Information, and the Source Data for Fig. 2 are available at 
http://web.iodp.tamu.edu/DESCReport/ (file name, 360_U1473A_mac- 
roscopic.xlsx). Source Data for Fig. 3 and Extended Data Figs. 1, 2, 6 are 
provided with the paper. 
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Extended Data Fig. 1| Archaeal core lipid analyses. a, b, Downhole changes in respectively) and crenarchaeol are shown. c, Structures of the most abundant 
core lipid composition of archaeal IPLs (a) and archaeal core lipids (b). The archaeal lipids. 1G, monoglycosidic; 2G, diglycosidic; AR, archaeol; GDGT, 
relative abundances of the diether lipid archaeol and tetraether lipids with glycerol dialkyl glycerol tetraether. Lipid data are from single measurements 
zero, one, twoand three rings (GDGT-0, GDGT-1, GDGT-2 and GDGT-3, owing to sample constraints. Source Data are available online. 
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Extended Data Fig. 2 | Rate of methane production observed inlong-term 
enrichment experiments. The rate at 25 weeks was derived from the value 
measured at 25 weeks minus the initial value (0) divided by the time elapsed in 
days. The rate at 60 weeks was derived from the difference between methane 


measured at 60 weeks and methane measuredat 25 weeks divided by the time 
elapsed since the 25-week measurement. Dataare from single measurements 

owing to limited sample availability, except for the ASW blank (n=3), for which 
the meanis displayed. Source Data are available online. 
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Extended Data Fig. 3 | 16S rRNAiTAG composition and metabolic processes 
detected in metatranscriptomes. Top, taxonomic composition at family level 
or deeper (where possible) of 16S rRNA iTAG sequences for taxa present as 
more than 1% of the total abundance in at least one sample. Taxa present at an 
abundance of less than1% in every sample were grouped into the ‘Others’ 
category. Bottom, an overview of categories of expressed genes ineachsample 


is given belowtheiTAG composition for each sample. The presence (black bars) 
or absence (white bars) indicates the detection or non-detection of genes 
associated with the processes and activities listed inthat sample. A detailed 
discussion of metabolic pathways is provided inthe Supplementary 
Information. Annotations at higher taxonomic resolution are available upon 
request from the corresponding author. 
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Extended Data Fig. 4| See next page for caption. 


Extended Data Fig. 4 | Non-metric multidimensional scaling and clustering 
analyses of detected prokaryotic OTUs and transcripts. a, Non-metric 
multidimensional scaling analysis performed onthe Jaccard distance matrix 
(Ward clustering using hclust; see Methods) of prokaryotic OTU presence or 
absence data for 11 biologically independent samples spanning the depth of 
hole U1473A showing only environmental vectors supported by P< 0.05. 
Pvalues were generated as P=N+1/n+Laftera goodness-of-fit statistic, which 
is the squared correlation coefficient, and was calculated on 999 random 
permutations of the data using the vegan package of R. b, Clustering analysis of 
presence or absence prokaryotic OTU data for 11samples spanning the depth 
of hole U1473A. Hierarchical clustering dendrogram based onaJaccard 


distance matrix of the presence or absence data for 99% OTUs from 11 

IODP Expedition 360 samples. Jaccard similarity values of >0.8, calculated with 
the clusterboot function inR, suggest a stable cluster. Shading highlights 
several samples that share certain OTUs. From left to right (1-4): 1, samples 
share SARI1clade II and SAR406; 2, samples share Nitrosopumilaceae; 3, 
samples share SAR11 clade II; and 4, samples both contain the lowest OTU 
counts.c, Clustering analysis of curated transcripts for 11 samples within the 
functional categories presented. Clustering analysis was based on 

log +1-transformed FPKM values and the Ward method, anda distance matrix 
was constructed using the Manhattan method and the pvclust function inR. 
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Extended Data Fig. 5 | Lipid sample blanks. a, Representative UHPLC-APCI- HPLC-ESI-MS chromatograms for selected bacterial DEG lipids with summed 
MS chromatogram, showing the detection of core GDGTs insample 84R6 and chain lengths from C30 to C36 in sample 26R2 and the sample extraction blank. 
the sample extraction blank. b, Representative multiple-reaction-monitoring Lipid data are from single measurements owing to sample constraints. 
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Extended Data Fig. 6| Drilling mud contamination control during lipid 
analysis. a, Depiction of outer, middle andinner sections subsampled from 
whole round core (WRC) sample 29R3 (259.03 mbsf) in order to test the 
influence of drilling mud (DM) on the lipid biomarker composition. 

b-d, Composition of archaeal core lipids (b), IPLs (c) and bacterial DEGs (d) in 
the drilling mud, subsampled WRC sample 29R3 and selected rock samples 


analysed inthis study. e, Extracted ion chromatograms of targeted DEG lipids 
from C28 to C39 summed carbon chain lengths in the drilling mud and 
subsampled WRC sample 29R3, showing the differencesin relative abundance 
and isomer composition among these samples. Lipid data are from single 
measurements owing tosampleconstraints. 2R1, 10.7 mbsf; 31R1, 274.6 mbsf; 
68R2, 619.6 mbsf; 81R2, 714.9 mbsf. Source Data are available online. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


x | The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


x | Astatement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


x| A description of all covariates tested 


x | A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 


x AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 
x For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 
x For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
x For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 
x Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection For Raman analysis of thin section samples we used the LabSpec 6 ® Spectroscopy Suite Software (https://www.horiba.com/en_en/ 
products/detail/action/show/Product/labspec-6-spectroscopy-suite-software-1843/) for data acquisition and background subtraction of 
acquired spectra. The processed spectra were then compared with the BIO-RAD KnowltAll® Raman Spectral Library databases for 
identification of organic substances and with the RRUFF (TM) Project Raman database for identification of minerals (Lafuente et al. 
2015) (Lafuente B, Downs RT, Yang H, Stone N (2015) The power of databases: the RRUFF project. In: Highlights in Mineralogical 
Crystallography, T Armbruster and R M Danisi, eds. Berlin, Germany, W. De Gruyter, pp 1-30). For lipid analyses, Sciex Analyst 1.6.3 and 
Sciex MultiQuant 3.0.3 (AB Sciex LP, Concord, Canada) were used for triple quadrupole mass spectral data acquisition and data 
processing. Bruker Compass 1.9 and Bruker Data Analysis Version 4.4 (Bruker Daltonics, Bremen, Germany) were used for quadrupole 
ime-of flight lipid data acquisition and processing. 


Data analysis The software R was used with the pvclust package for nMDS and clustering analysis of OTU data. Trimmomatic v. 0.32 was used to trim 
adapter sequences in transcript data, FastQC v. 0.11.7 and FASTX-toolkit v. 0.014 were used to quality check and trim paired reads in 
transcriptome libraries, Trinity v. 2.4.0 was used for assembly of transcriptomes, Bowtie v. 2.3.4.1 was used to align reads to our 
assembly and for de novo assembly of reads in our libraries, Bowtie aligner was used to align reads to assembled contigs, RSEM was used 
to estimate expression levels of reads in our libraries, TMM was used to perform cross-sample normalization and to generate a TMM- 
normalized expression matrix. The Trinotate suite (TransDecoder v. 3.0.1) was used to identify coding regions, BLASTx and BLASTp against 
UniProt, Swissprot (release 2018 _02) and RefSeq nr were used to look for homologies between our reads and databases. HMMER v. 
3.1b2 was used to identify conserved domains by searching against the Pfam v. 31.0 database. SignalP v. 4.1 and TMHMM 2.0c were used 
to predict signal peptides and transmembrane domains, RNAMMER v.1.2 was used to identify rRNA homologies. BLAST against the KEGG, 
COG, SEED and MetaCyc databases using MetaPathways v. 2.0 was used to interpret functions and pathways active in our transcriptome 
data. For our iTAG data Raw sequence reads were processed through Trim Galore [http://www.bioinformatics.babraham.ac.uk/projects/ 
trim_galore/], FLASH (ccb.jhu.edu/software/FLASH/) and FASTX Toolkit [http://hannonlab.cshl.edu/fastx_toolkit/] for trimming and 
removal of low quality/short reads. Quality filtering included requiring a minimum average quality of 25 and rejection of paired reads less 
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than 250 nucleotides. Operational Taxonomic Unit (OTU) clusters were constructed at 99% similarity with the script pick_otus.py within 
the Quantitative Insights Into Microbial Ecology (QIIME) v.1.9.1 software 45and ‘uclust’. The OTU data table was transformed to a 
presence/absence table and the Jaccard method was used to generate a distance matrix using the dist.binary() function in the R package 
ade4. A hierarchical clustering dendrogram was created using hclust() and the stability of the clusters was evaluated using the 
clusterboot() function in the fpc package in R with 500 iterations. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


iTAG data for the 11 samples as well as 13 different negative controls (drilling muds and fluids, seawater, kit controls) are deposited in GenBank under BioProject 
PRJNA497074. Results are presented at ~phylum level, however taxonomic assignments at finer resolution are available from the authors upon request. Raw reads 
for transcriptome data have also been doposited in GenBank SRA under BioProject PRJNA497074. Assemblies for curated portions of the data presented are 
available upon request to the authors. 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


* | Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Samples available for microbiological studies were selected by the Co-Chief Scientists of Expedition 360. They were limited to ~10-20cm long 
whole round rock sections from recovered cores. Replication was not possible because there was a single hole drilled for all studies conducted 
during Expedition 360, and only the 10-20 cm sections of 10m cores were allocated to microbiologists. Each of the individual 10-20cm core 
sections was processed to remove exterior rock and remaining material was subdivided for all analyses presented here. 


Data exclusions Outside of subtraction of molecular data matching any contamination controls (as noted in methods), no data were excluded. All data 
collected from analyzed samples is presented. 


Replication Cell counts were replicated by processing two replicate 1ml tubes of material from each core sample. Alkaline phosphatase activity 
measurements were conducted on triplicate technical replicates of material for those analyses and results were consistent. Methane 
production measurements are single measurements due to limited sample availability, except for the artificial seawater blank (n=3), for which 
the mean value is presented. All attempts at replication were successful. Sufficient material was not available for replicating other analyses. 
Lipid biomarker measurements were single measurements due to limited sample material. Raman spectral analysis presented in Figure 3a was 
made at two spots on this particular feature of interest, 10 independent times per spot, with similar results. In total, we performed >2000 
analyses (with shorter integration times) on similar features in the same thin section sample with similar results. 


Randomization Eleven rock samples were selected from our collection for analyses presented here, purely on visual observations of the rock samples, 
prioritizing samples from the full depth of the hole drilled, and samples with evident veins and signs of alteration. 


Blinding Blinding was not applicable to this study of rock samples and associated control samples beyond the fact that all samples were number coded 
and treated identically throughout preparation and analyses. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 
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Most cortical synapses are local and excitatory. Local recurrent circuits could 
implement amplification, allowing pattern completion and other computations’ *. 
Cortical circuits contain subnetworks that consist of neurons with similar receptive 


fields and increased connectivity relative to the network average*®. Cortical neurons 
that encode different types of information are spatially intermingled and distributed 
over large brain volumes*’, and this complexity has hindered attempts to probe the 
function of these subnetworks by perturbing them individually*. Here we use 
computational modelling, optical recordings and manipulations to probe the 
function of recurrent coupling in layer 2/3 of the mouse vibrissal somatosensory 
cortex during active tactile discrimination. A neural circuit model of layer 2/3 revealed 
that recurrent excitation enhances sensory signals by amplification, but only for 
subnetworks with increased connectivity. Model networks with high amplification 
were Sensitive to damage: loss of a few members of the subnetwork degraded stimulus 
encoding. We tested this prediction by mapping neuronal selectivity’ and 


photoablating””” 


neurons with specific selectivity. Ablation of a small proportion of 


layer 2/3 neurons (10-20, less than 5% of the total) representing touch markedly 
reduced responses inthe spared touch representation, but not in other 
representations. Ablations most strongly affected neurons with stimulus responses 
that were similar to those of the ablated population, which is also consistent with 
network models. Recurrence among cortical neurons with similar selectivity 
therefore drives input-specific amplification during behaviour. 


Two circuit motifs contribute to neural dynamics in cortical layer 2/3 
(L2/3): recurrent excitation, which may cause amplification’ *"”, and 
feedback inhibition, which may account for the sparse activity typical 
of L2/3”"“*. Weexplored the role of these motifsin an integrate-and-fire 
model of L2/3 of the mouse vibrissal somatosensory (‘barrel’) cortex 
constrained by measured physiological properties" (Methods). 

To model input-specific recurrent coupling, we restricted the sensory 
input to asubnetwork of the excitatory neurons (200 out of 1,700), cor- 
responding to the number of L2/3 barrel cortex neurons that respond 
to active touch’”’ (Fig. 1a). We simulated touch-related input to L2/3 
based on recordings of their L4 inputs'* (Methods). To measure how 
accurately neural activity reflects the sensory input, we computed an 
‘encoding score’ (Retimutus) by cross-correlating the spike rate of each 
neuron with the input (Methods). 

We varied recurrence by changing the connection probability within 
the input-recipient subnetwork (subnetwork connectivity; synaptic 
conductance was scaled proportionately*) (Methods). For the connec- 
tivity of each subnetwork, we matched the distribution of the encoding 
score to that measured in vivo’ by adjusting the strength of the L4 input. 
Subnetworks with connectivity equal to and moderately increased 


relative to the rest of the network (non-subnetwork connectivity, 0.2; 
subnetwork connectivity range, 0.2-0.4) produced responses that 
were consistent with those observed in vivo’ (Fig. 1b, Methods). The 
strength of sensory input requiredto match in vivo responses declined 
with increasing subnetwork connectivity. Amplification—defined as 
theratio of network output to network input—therefore increased with 
subnetwork connectivity! (Fig. 1c). Additional increases in subnetwork 
connectivity (>0.4) produced all-or-none network responses, in which 
a transient input drove the network into a persistently active state” 
(Extended Data Fig. 1). 

Overall, subnetwork behaviour fell into three regimes, each of which 
produced a distinct response to removal (or ablation) of asmall num- 
ber of neurons. Subnetworks with low connectivity (0.2) amplified 
little, and were resistant to ablation (Fig. Ic—f). Encoding scores for 
spared neurons increased after simulated ablation of 10% of the sub- 
network", owing to reduced feedback inhibition®”° (encoding score, 
from 0.237 + 0.027 to 0.274 + 0.032 (grand median + adjusted median 
absolute deviation (MAD)); P< 0.001, Wilcoxon signed-rank test, across 
n=30 simulated networks with different randomized connectivity and 
initial conditions) (Fig. 1d-f, Methods). 
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Fig. 1| Ablation effect in simulated cortical L2/3 network depends on 
excitatory connectivity. a, The L2/3 network model comprises a subnetwork 
of 200 excitatory neurons (black) receiving sensory input (dark grey), alarger 
excitatory population (1,500 neurons; light grey) without sensory input, and 
300 inhibitory neurons (red). All populations are interconnected (connection 
probabilities listed in figure) (Methods). The probability and strength of 
connections within the excitatory subnetwork was varied (Peon, thick black 
loop) (Methods). b, Model network responses aligned to input (arrows). Left, 
subnetwork connectivity for excitatory subnetwork equal to overall 
connection probability. Right, increased connectivity. Raster plots showa 
subset of neurons from an example network. Peri-stimulus time histograms 
(PSTHs) show mean values across all neurons and networks (n=30). Bottom, 


Subnetworks with increased connectivity (for example, 0.4) exhib- 
ited stronger amplification’ (Fig. 1c). Ablations reduced encoding 
scores for spared neurons, from 0.243 + 0.049 to 0.143 + 0.028 (con- 
nectivity = 0.4; P<0.001,n=30 networks), because of reduced ampli- 
fication (Fig. 1d-f). Networks in the all-or-none regime”? were robust 
to ablation, and maintained their all-or-none response (Extended 
Data Fig. 1). The response of subnetworks to ablations therefore dis- 
tinguishes the three regimes. 

We next performed a similar analysis for actual L2/3 networks during 
behaviour. We trained mice with a single spared whisker on an object 
localization task (Fig. 2a,b, Methods) and recorded neuralactivity with 
volumetricimaging’ in L2/3 of the barrel column corresponding to the 
spared whisker (8,126 + 2,436 (mean +s.d.) neurons per mouse, n=16 
mice) (Fig. 2c, Extended Data Table 1, Methods). Activity in a subset 
of neurons encoded whisker position (hereafter whisking neurons), 
whereas others responded to touch-induced changes in whisker curva- 
ture (hereafter touch neurons)’ (Fig. 2d, e). An encoding model gener- 
ated a prediction of neural activity from vibrissal kinematics (Methods). 
Correlating the prediction with the actual neural activity yielded an 
encoding score, which was used to assign neurons to the touch and/or 
the whisking representations’ (Methods). Across 16 mice (Extended 
Data Table1), 901+ 539 of the imaged neurons encoded touch (fraction: 
0.108 + 0.051) and 865 + 364 encoded whisking (fraction: 0.106 + 0.028). 

We probed the roles of recurrence by ablating members of the touch 
representation and examining the effect on spared neurons. Several 
excitatory neurons were ablated using multiphoton excitation’”” 
(Extended Data Figs. 2,3). Ablating asmall proportion ofstrongtouch 
cells (16.8 + 12.8 neurons, 6% of touch neurons in the barrel column of 
the spared whisker, n= 9 mice; touch score, 70th + 35th percentile) 
(Fig. 2f) reduced responses to touch in the spared touch neurons 
(Fig. 2g, h). The touch-encoding score (R,ouch) across touch neurons 
declined (from 0.123 + 0.021to 0.100 + 0.037 (grand median + adjusted 
MAD); n=9 mice, 8,392 neurons, P= 0.004, Wilcoxon signed-rank test 


excitatory neurons within subnetwork; middle, excitatory neurons outside 
subnetwork; top, inhibitory neurons.c, Amplification, defined as the ratio of 
network output to sensory input (Methods), asa function of subnetwork 
connectivity, normalized to Poonn = 0.2 (mean across 30 networks per Pronn)- 

d, Predicted effects of ablation. In the equal-connectivity case (top), feedback 
inhibition dominates and responses among spared neurons increase. In the 
increased-connectivity case (bottom), recurrent excitation dominates and 
responses decline. e, Asinb, but after ablation of the 20 neurons withthe 
strongest encoding score. f, Effect of ablation on stimulus encoding, asa 
function of subnetwork connectivity. Grey points denote cross-neuron median 
for individual network. Black circles denote the grand median of 30 simulated 
networks. 


for mouse medians, paired by mouse) (Fig. 2j, Methods), as did the 
touch neuron count (from 932 + 634 to 716 +469 (mean +s.d.), calcu- 
lations exclude ablated neurons) (Methods). The whisking-encoding 
score (Ry risking) did not change (from 0.116 + 0.013 to 0.115 + 0.024; 6,975 
neurons, P=0.820; neuron count: from 775 + 267 to 721 +241) (Fig. 2i, k). 

In the model, more extensive ablations caused larger declines in 
encoding scores (Extended Data Fig. 4). In agreement with this predic- 
tion, the decline in touch representation increased as more of the touch 
representation was ablated (Pearson correlation of changein R,,,,.,and 
net R,.,, ablated, R=—-0.794, P< 0.001acrossall 24 ablations; R=-0.779, 
P=0.013 for the 9 touch cell ablations) (Fig. 21). Touchneurons proximal 
(15-35 um) to the ablated cells experienced a larger decline in R,oyc, Val- 
ues than those distal (115-135 um) from the ablated cells (Fig. 2m). The 
effects of ablating the touch neurons decayed over a distance (A= 87 pm, 
exponential fit) (Fig. 2m) similar to the spatial scale of local recurrent 
connectivity inthe rodent sensory cortex”. Whisking neurons exhibited 
no distance-dependent changes (Extended Data Fig. 5). The declining 
touch representation was not caused by changes in whisker movement 
or behaviour (Extended Data Fig. 6). This result is consistent with ampli- 
fication of touch responses by recurrent excitation in L2/3. 

By contrast, ablating a subset of strong whisking neurons (12.7 + 5.7 
neurons, approximately 4% of whisking neurons in the barrel column 
of the spared whisker, n= 7 mice; whisking score: 66th + 37th percen- 
tile) produced no effect on either the touch representation (R,,,,,, from 
0.112 + 0.081to 0.095 + 0.069, n=7 mice, 5,866 neurons; P=0.109, count 
from 838 +401to 899 + 548) or the whisking representation (RyniskingffOM 
0.108 + 0.076 to 0.107 + 0.076,n=6,161; P= 0.812, count from 880 +387 to 
974 + 558) (Fig. 2j, Extended Data Fig. 7). Similarly, ablating silent neurons 
(event rate below 0.025 Hz; 16.3 +2.6 neurons, n=8 mice) did not change 
the touch representation (R,,,,., from 0.115 + 0.021 to 0.107 + 0.023,n=8 
mice, 7,110 neurons; P= 0.383, count from 889 +713 to 926 + 681) (Fig. 2k, 
Extended Data Fig. 7, Methods) or the whisking representation (R whisking 
from 0.115 + 0.014 to 0.114 + 0.015, 7,684 neurons; P= 0.844, count from 
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Fig.2|Touchnetworks ofthe barrel cortex are sensitive to ablation. a, Mice used 
one whisker to locate a pole and reported the perceived position by licking one of 
twolick ports (red or blue). b, Task epochs (Methods). c, Imaging planes were 
centred on the barrel column of the spared whisker (magenta). Three planes were 
imaged simultaneously (same shade of grey).d, Example video frame showing the 
whisker (magenta), pole (black), whisker curvature (x, blue), and whisker angle 

(8, green). e, Neuronalactivity for a touch (cell1) and whisking (cell 2) cell. Blue 
denotes change in whisker curvature (Ax); green denotes whisker angle (6). 

f, Ablations. Pyramidal neurons1and 2 (orange) were ablated (green, GCaMPé6s; 
red, mCherry). Bottom, touch-aligned neuronal responses. g-i, Example 
experiment.g, Touch responses (dots, calcium events) (Methods) for asubset of 
neurons before (top) and after (bottom) ablation. Grey boxes denote 200 non- 
touch, non-whisking neurons; blue boxes denote 200 touchneurons. Right, touch- 


961+ 464 to 904 +384). Touch ablations produced significantly differ- 
ent changes in encoding scores among touch (but not whisking) repre- 
sentations than either whisking or silent ablations, whereas whisking 
ablations did not produce significantly different changes from silent 
ablations (Fig. 2j, k). Non-specific effects of ablation therefore do not 
contribute to changes in the touch representation after ablation. 

Ablation of whisking neurons did not degrade the whisking rep- 
resentation. Our network model suggests that the slower kinetics of 
the whisking input do not account for this lack of effect, as networks 
having increased subnetwork connectivity (0.4) with slower input 
(tpeax = 50 ms, versus 10 ms for touch simulations) were sensitive to 
simulated ablation (Extended Data Fig. 8, Methods). Touch input is 
in-phase across the neural population’, which engages recurrent 
excitation. By contrast, individual neurons encode whisking input with 
different phases”; this asynchronous population response is expected 
to engage recurrent excitation less effectively. Therefore, the lack of 
sensitivity to ablation in the whisking population does not necessarily 
indicate an absence of recurrent coupling. 
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aligned mean AF/F values across these neurons for the strongest 5% of touches. 
Shading denotes s.e.m.h, Left, map for touch cells before ablation. Sphere size 
denotes R,ouch Values. Grey dots denote other neurons. Orange denotes the 
position of ablated neuron. Centre, Ryouch Values for ablated neurons. Right, Rrouch 
values after ablation. i, As inh, but for whisking neurons. j, Median change in Ryouch 
values across neurons in individual mice after ablation. Blue, touch ablations; 
black, silent neuron ablations; green, whisking ablations. Pvalues determined by 
Wilcoxon rank-sum test.k, Asinj, but for Ryrisking Values. 1, Ablation effect on change 
in Ryouch Value as a function Of R,oych Values summed over the ablated neurons. 
Colour as inj.m, Distance-dependence (with respect to closest ablated neuron) of 
changeinR,,,,., after ablation. Proximal change in P,o,.:h:-0.132+ 0.246 (grand 
median +adjusted MAD), distal: —0.056 + 0.241; Pvalue determined by Wilcoxon 
signed-rank test for proximal versus distal medians, paired by mouse,n=9 mice. 


In recurrent networks connected ina feature-specific manner’, the 
effects of ablation on spared neurons should increase with the similarity 
of their tuning to the ablated population”>*. We tested this intuition 
in our model, defining the response similarity as the correlation of 
single neuron activity with the mean activity of the ablated neurons 
(Fig. 3a, Methods). In networks with increased subnetwork connectiv- 
ity, neurons with high response similarity showed the largest decline in 
encoding score after ablation (Fig. 3b). Innetworks without increased 
subnetwork connectivity, the relationship disappeared (Fig. 3b). 

We performed asimilar analysis on our experimental data. Because 
not all neurons were recorded simultaneously (Fig. 2c, Methods), it 
was not possible to compute a mean across ablated neurons or cor- 
relations between that mean and the activity of individual neurons. 
We therefore devised a response similarity metric that did not require 
simultaneous recording. For each neuron, we averaged pre-ablation 
responses across both trial types (Fig. 3c). Concatenating these yielded 
the trial-averaged AF/F of the neuron (Fig. 3d). Computing the mean 
trial-averaged AF/F across all ablated neurons provided the ablated 
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Fig. 3 | Effect of ablations depends on response similarity to ablated cells. 

a, Response similarity in simulated networks. Top, example neuron spike rate 
(black). Grey arrows denote sensory input. Orange denotes mean spike rate 
across ablated neurons. Bottom, response similarity was computed by 
correlating neuronal spike rate with the mean ablated neuron spike rate. 

b, Dependence of the change in encoding score after ablation on response 
similarity in model networks. Grey dots denote individual neurons. Dark grey 
lines denote single network averages. Pvalues for sign test that the slopes of 
linear fits across all networks (n= 30; Peony = 0.2 case: 3,779 neurons; Pron = 0.4 
case: 3,466 neurons) are O. Black line denotes grand mean across networks. 
Magentatriangle denotes neuron froma.c, Trial averaged response for 
example touch neuron. Heat maps show AF/F values for individual trials 
(proximal pole trials, left; distal, right). Bottom, trial-averaged AF/F values. 

d, Top, trial-averaged AF/F values for neuroninc. Bottom, the mean trial- 
averaged response across ablated neurons. Right, responsesimilarity is the 
correlation of individual neuron trial-averaged response vectors with the mean 
across ablated neurons. e, Changes induced by ablation of touch neurons onthe 
touch score asa function of response similarity in an example mouse. Blue circles 
denote individual neurons; blue line denotes binned mean (Methods). Slope fora 
linear fit of the points is given. f, Population data for all touch ablations for both 
touch (blue, left) and whisking (right, green) neurons. Thin coloured lines denote 
individual mouse mean values for agiven response similarity bin; thick lines 
denote cross-mouse mean values. Pvalues are from asign test in which the slope 
of change in encoding score asa function of response similarity is 0 (n=9 mice; 
2,768 touch neurons; 1,085 whisking neurons). g, Asinf, but for ablation of 
whisking neurons (n=7 mice; 1,692 touch neurons; 935 whisking neurons). 


neuron mean response. Response similarity was measured for each 
neuron as the correlation between its trial-averaged AF/F response 
and the mean response across all ablated neurons. 

Changes in R,oych Values after ablation depended on response simi- 
larity (Fig. 3e, f). R,ouch Values in neurons with high response similarity 
declined. Neurons with negative response similarity showed increased 
Rrouch Values, potentially owing to reduced feedback inhibition pre- 
viously evoked by the ablated neurons” (Fig. 3e). Ablation of touch 
neurons had no effect on the whisking network (Fig. 3f). Ablation of 
whisker neurons had no effect on either representation (Fig. 3g). 

Targeted photoablation allowed us to test the roles of recurrence in 
cortical circuits’. The selective degradation of representations similar 
to the ablated neurons is consistent with amplification in recurrent 
networks! 12.2326 (Fig. 1), but inconsistent with circuit models of 
sparse coding that are dominated by feedback inhibition” “”°”’ or 


models with all-or-none activity”. Our experiments reveal that corti- 
cal circuits can be surprisingly sensitive to damage targeting specific 
representations, despite remarkable robustness to other types of 
perturbation”’. 
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Methods 


Network model 

We modelled the L2/3 network associated with a single barrel column— 
one of the most extensively studied cortical microcircuits”’*°—as a 
network of leaky integrate-and-fire neurons”. The dynamics of each 
neuron were governed by: 


oa =V,- V(t) + RE) + H(t) + 1*(0)] (1) 


where Vis the membrane potential, V, is the rest/reset potential, 7 is 
the membrane time constant, R is the input resistance, /° and /"" are 
excitatory and inhibitory synaptic currents, /is a current represent- 
ing sensory stimulus drive (for example, from layer 4 inputs), and 
iindexes the neurons in the network. When the membrane potential 
reaches the spiking threshold, V,,, a spike is emitted, the membrane 
potential is reset to V,, and the dynamics of the neuron are frozen for 
a short refractory period, t,.;. The synaptic currents follow kick-and- 
decay dynamics: 


arp” 
‘nae 


= ~ 1+ Tn > w,O(t- di ty ) (2) 
jk 


where ‘syn’ denotes the type of synapse (either excitatory or inhibitor), 
Tn is the synaptic time constant, wis a matrix of synaptic strengths 
fromneuronjtoneuroni,¢, isthe time of the Ath spike of neuronj,and 
t,is the spike transmission delay. The sum over jis over all neurons, 
while the sum over kis over all spikes from that neuron. 

The network comprised 2,000 neurons, of which 1,700 (85%) were 
excitatory and 300 (15%) were inhibitory’®. Excitatory neurons had 
tT =30 ms, whereas inhibitory neurons had faster dynamics with 
T=10 ms", As the subthreshold dynamics for these neurons are lin- 
ear, the behaviour of these neurons is invariant to changes of scale in 
V. The meaningful quantity is AV= V,, — V,, which we assume to be, on 
average, 35 mV for all neurons“. Each neuron was assigned a value 
chosen uniformly within a range of +50% of this mean value. Excitatory 
and inhibitory post-synaptic currents had time constants of ,,.=2ms 
and T,,,= 3 ms, respectively. The refractory period for spiking was 
t,-¢= 0.5 ms. Synapses had a mean synaptic delay of t,; = 0.6 ms"; each 
individual synapse was assigned a value chosen within a range of +50% 
of this mean value. Although all neural parameters could be modelled as 
random variables, we found that jittering AV for the neurons and t, for 
the synapses provided sufficient heterogeneity to give a broad range 
of baseline spike rates across neurons and also prevented network 
synchronization. The input resistance, R, was factored into the synaptic 
weights and the magnitudes of the external currents, as described 
below. 

The excitatory population was subdivided into two groups: a small 
input-recipient subnetwork of 200 neurons and the remainder of the 
excitatory population (1,500 neurons). Although all neurons in L2/3 
probably receive touch-related input from L4*, only asmall proportion 
are drivento spike after touch’”, justifying this model assumption. We 
neglected feedforward inhibition®. This is justified because feedfor- 
ward inhibition simply rescales input from L4’, which in our model is 
adjusted to produce experimental observed population activity levels. 
Thus, each neuron belonged to one of three groups: the excitatory 
subnetwork (S), the remainder of the excitatory population (E), or the 
inhibitory population (1). Connections between neurons are deter- 
mined by ablock stochastic model. Giventwo neurons-the first from 
group A, the second from group B-there was a fixed probability of a 
connection from the first neuron to the second, denoted p,,. 

In our ‘equal subnetwork connectivity’ network, we used sparse con- 
nectivity between excitatory neurons, Dsc = Psp = Des = Deg = 0.2, and 
more dense connectivity both within the inhibitory population as well 


as between the excitatory and inhibitory populations, py =p\s5 = Die = 
Ds = Pp = 0.6 (refs. 5°). All connections had the same strength, which 
was chosen so that the resulting post-synaptic potential (PSP) is 1mV 
(refs. °°), Because it is the only connectivity parameter that was varied 
systematically, we denote Pg aS ‘Peony in the rest of the text. 

In the increased-subnetwork-connectivity version of the network, 
P.onn Was increased to 0.4. To replicate the experimentally observed 
relationship between connectivity and synaptic strength’, the synap- 
tic weight for these connections was increased to give PSPs of 1.6 mV. 
Networks with other levels of connectivity within the subnetwork were 
produced by linear interpolation or extrapolation of both P.,,,, and 
synaptic strength between the equal-subnetwork-connectivity and 
increased-subnetwork-connectivity cases. 

The stimulus drive to the network was modelled as an external cur- 
rent targeting the excitatory subnetwork (group S; f= 0 for all neurons 
in groups Eand 1). Foreach stimulus presentation, the waveform of the 
current was modelled with a beta distribution with shape parameters 
a=3and B=5. The beta distribution is defined on the interval [0, 1], 
giving a distinct beginning and end to the stimulus. For the chosen 
shape parameters, the beta distribution has a value of O atits end points 
anda peak at (a-1)/(a+ B - 2). To model the fast touch stimulus, the 
waveform was stretched in time so that the peak occurs 10 ms after 
the start (full-width half-maximum, 12.8 ms)'8. The amplitude of the 
stimulus waveform was chosen so that the network response matched 
the experimental data. All neurons also receive tonic background input 
inthe form ofa Poisson spike train of excitatory spikes with a frequency 
of 5,000 Hzfor excitatory neurons and 2,000 Hz for inhibitory neurons; 
these values were selected so that the tonic firing rate of these popula- 
tions were approximately 0.5 Hz for excitatory neurons and 10 Hz for 
inhibitory neurons* 

Simulations were performed in Python using the Brian2 simulation 
package” with a step-size of dt=0.1 ms. For each randomly sampled 
network connectivity, the network was first simulated for 20s of model 
time (corresponding to 66 stimulus presentations) and spike trains 
for all neurons were recorded. The activity of each neuron was then 
given an encoding score, which quantifies the signal-to-noise ratio 
of the representation of the stimulus: the spike train of the neuron 
was convolved with a Gaussian kernel (standard deviation, 20 ms) to 
produce a firing rate; the firing rate as well as the stimulus waveform 
was down-sampled by a factor of 5 (to a sample period of 0.5 ms) and 
the normalized cross-correlation between the signals was computed 
for leads/lags up to 10 ms; the peak value this cross-correlation is the 
encoding score. 

We first ran exploratory simulations to constrain the strength of 
the input to the subnetwork to match physiological data’. We simu- 
lated both the equal-subnetwork-connectivity (P.,,,, = 0.2) and the 
increased-subnetwork-connectivity (Poon, = 0.4) networks across a 
range onsensory input strengths. In both cases, we examined the dis- 
tributions of excitatory neuron encoding scores as a function of input 
strengths, selecting the input strength that produced a distribution 
most closely matching the experimental data (in terms of distribu- 
tion shape and fraction of neurons encoding the stimulus). With input 
strengths defined for these two cases, we then used linear interpola- 
tion or extrapolation to select the input strength for other amounts of 
subnetwork connectivity. Because this procedure ensured a fixed net- 
work output following the simulated sensory stimulus, amplification 
(ratio of network output to sensory input) (Fig. 1c) was defined as the 
inverse of the sensory input strength”. Amplification was normalized 
to the case where neurons within the input-recipient subnetwork had 
connectivity probabilities equal tothe non-input recipient excitatory 
population (that is, Poa, = 0.2). 

The final set of simulations explored the effects of targeted ablation 
across a range of subnetwork connectivity levels (Figs. 1, 3, Extended 
Data Fig. 1), number of ablations (Extended Data Fig. 4), and input 
kinetics (Extended Data Fig. 8). To simulate targeted ablation, we used 


30 randomized network instances for each subnetwork connectiv- 
ity level, number of ablations, and input rise time, and calculated the 
pre-ablation encoding score for each neuron. The 20 excitatory neu- 
rons with the highest encoding score were then removed (all outgo- 
ing synaptic strengths set to 0), the simulation was repeated, and the 
encoding scores after ablation were computed. Examination of the 
effects of the number of ablated neurons was done with the top 0,10, 
20,50 and 100 neurons (Extended Data Fig. 4). This was repeated for 30 
different realizations of the stochastic network connectivity. Neurons 
were considered to be part of the sensory input representation ifthey 
had an encoding score above 0.1; the effect of the ablation (Fig. le, f) 
was quantified as the change in encoding score across all neurons that 
met the 0.1 encoding score criteria either before and/or after ablations. 
The response similarity analysis (Fig. 3) used a more stringent encoding 
score cut off of 0.25; using a cut off of 0.1 did not alter the result, but 
reduced the magnitude of the observed effect. The ablated neurons 
were excluded for calculations of the distributions encoding scores 
before and after ablation, as well as in constructing PSTHs (Fig. 1b, e). 

In modelling the effect of the number of neurons ablated onthe 
change in encoding score (Extended Data Fig. 4), we restricted our 
modelling to the increased subnetwork connectivity case, Peony = 0.4. 
For simulations of ‘whisking’ input (Extended Data Fig. 8), we shifted 
froma ‘touch’-like stimulus, peaking 10 ms after stimulus onset, to one 
peaking 50 ms after onset to mimic the response to whisking input 
observed in vivo!”!®”235, We examined both the equal (P.,,,, = 0.2) and 
increased (P.,,.., = 0.4) connectivity cases. 

For large subnetwork connectivity (P..,,, > 0.4), the network transi- 
tioned to all-or-none behaviour, where strong enough input can drive 
the subnetwork into a state of persistent increased firing!’ (Extended 
Data Fig. 1). Toreset the network after sucha transition, we introduced 
astrong pulse of excitatory current to the inhibitory population300 ms 
after the stimulus onset. In this regime, the extrapolation scheme for 
setting the stimulus strength did not apply, because spike rate was 
dictated by network properties and not the input strength. Instead, the 
input strength affected the reliability with which a stimulus would cause 
a transition to the state of increased firing. We found that choosing a 
stimulus strength that was twice that dictated by our extrapolation 
scheme produced a reasonable number of stimulus-encoding neurons. 
Thus, we used this criterion to set the input strength in this regime. 

PSTHs were constructed by averaging the Gaussian-convolved (ker- 
nel standard deviation, 20 ms) responses of individual neurons, aligned 
to the sensory input. 


Determining model synaptic weights 
Model synapses were defined by kick-and-decay dynamics of the 
post-synaptic currents. We set the synaptic weights to produce a 
desired amplitude of the resulting unitary PSP. Here, we derive the 
relationship between the synaptic weight and the PSP size that allows 
us to do this. 

Assume that a synaptic current starts at /= 0 whena single spike of 
weight w arrives at t= 0. Inthe absence of any other spikes, the subse- 
quenttime-course of the current is found by integrating equation (2): 


I(t) = we™t/*syn (3) 


To determine the resulting behaviour of V(t), make the assumption 
that the PSP evolves according to a difference of exponentials: 


V(t)= Vole" -e" ) (4) 
Differentiating equation (4) and using equation (3), one can show 
that this form does indeed solve equation (3) if we choose: 


Rw 
Tin/Ts— 1 


T= Ty T= Tyr and Vo= 


(S) 


The PSP size isthe maximum value of V(t), which we can compute as: 


V, 


max — Yat * "(a > 1) (6) 
in which a =T,,/T, is the ratio of the time constants. Comparing this to 
the expression for V, in equation (6) we find our desired relationship: 


-4\71 
Rw=V,,a\* ) (7) 
The linearity of the synaptic dynamics allows us to use equation (7) 
to avoid explicitly determining the value of R. 


Mice 

All procedures were performed in compliance with the Janelia Research 
Campus Institutional Animal Care and Use Committee and the New 
York University University Animal Welfare Committee. Two transgenic 
lines were used for these experiments, differentiated in Extended Data 
Table 1 by a‘j’ or ‘n’ in mouse ID. Mice with IDs starting with letter ‘j’ con- 
sisted of mice expressing nuclear localized mCherry ina Cre-dependent 
manner (R26-LSL-H2B-mCherry’; JAX 023139) crossed with mice express- 
ing Creincortical pyramidal neurons (Emx1-IRES-cre*®; JAX 005628). In 
cortical L2/3, these mice expressed nuclear mCherry only in nuclei of 
excitatory neurons. Several (9 or 12) injections (450 tm deep, 300 pm 
apart; beveled pipettes, World Precision Instruments; 20 nl each, at10 nl 
min™ with a custom-built microinjector) of AAV2/1-syn-GCaMP6s 
(UPenn AV-1-PV2824)°? were made in barrel cortex (3.6 mm lateral, 
1.5 mm posterior) of young adult mice’ (6-8 week). After viral injec- 
tion, atitanium head bar was attached to the skull andthe craniotomy 
was covered with a cranial window. Craniotomies were always over 
the left hemisphere. Mice with IDs starting with ‘n’ consisted of mice 
expressing GCaMPé6sinaCre-dependent manner (Ai162"°, JAX 031562), 
crossed with Slc17a7-IRES2-cre*° (JAX 023527) to restrict expression to 
in pyramidal neurons. Surgeries for these mice did not include viral 
injections but were otherwise identical. With the exception of j258836, 
all mice were male. 


Behaviour 

Approximately one week after surgery, mice were trimmed to asingle 
row of whiskers (typically the C row) and placed on water restriction” 
(1ml per day). Training commenced 5-7 days after restriction onset. 
Mice were trained on an object-localization task” (Fig. 2a, b). If the 
pole appeared ina range of proximal positions, the mouse would be 
rewarded with a small water drop for licking the right lick port; pole 
presentation at the distal position would be rewarded upon licking the 
left lick port (Fig. 2a). Trials consisted of a1-1.2-s sample epoch followed 
by a0.5-1.2-s delay epoch after which a 50- or 100-ms 3.4 kHz auditory 
response cue signalled to the mouse to respond (Fig. 2b). To prevent 
premature licking, the lick port was brought into tongue range by a 
motor (Zaber) only during the response epoch (Fig. 2b). Mice exhibiting 
excessive premature licking (licks before reward cue on >20% of trials; 
licks monitored with a laser beam; Thorlabs) were not used. 

Mice were trimmed to a single whisker after reaching criterion per- 
formance (d’>1.5 for two consecutive days). The spared whisker barrel 
column was identified using the neuropil signal, as described previ- 
ously’. Whisker videography was performed at 400-500 Hz. Whiskers 
weretracked using an automated software pipeline“ and then curated 
using custom browser software’. 

Mice were assigned randomly to experimental groups (ablation 
type). Experimenters were not blinded to the group. 


Imaging 

Calcium imaging was performed using a custom two-photon micro- 
scope (http://openwiki.janelia.org/wiki/display/shareddesigns/ 
MIMMS) with a16x, 0.8 NA objective (Nikon)’. GCaMP (BG22; Chroma) 
and mCherry (675/70 filter; Chroma) fluorescence was imaged using 
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GaAsP PMTs (Hamamatsu). The 940 or 1,000 nm (Coherent) imaging 
beam was steered with a16 kHzline rate resonant galvanometer (Thor- 
labs); a piezo collar (Physik Instrumente) moved the focus axially. The 
512 x 512 pixel images were collected at 7 Hz, with three 600 x 600-~um 
images per piezo cycle. Planes were spaced 15 im apart. Scanimage* 
(Vidrio Technologies, http://www.vidriotech.com) controlled the 
microscope. Three planes, constituting a ‘subvolume’, were imaged 
simultaneously (4-6 subvolumes, spanning 45 ppm each, 12-18 total 
planes, 180-270 um in depth total), and power was modulated with 
depth using a length constant of 250 um. Deeper subvolumes were 
typically imaged with higher power, using a similar length constant. 
Individual subvolumes were typically imaged for SO trials per day, with 
all subvolumes usually visited on any given day. Alignment across days 
was performed as previously described”**. 

Imaging data were processed using a semi-automated software 
pipeline that included image registration, segmentation, neuropil 
subtraction, AF/F computation, and event detection’. The de-noised 
AF/F trace, which consisted of the event amplitude trace convolved 
with event-specific exponential rise and decay, was used for analysis. 


Neuronal classification 
Neurons were classified using a linear-nonlinear encoding model”. 
The model consisted of a cascaded generalized linear model that used 
a temporal and stimulus domain kernel to predict the activity of indi- 
vidual neurons given whisker angle, 6, or whisker curvature, K, assuming 
Gaussian noise with input nonlinearities. 

The model predicted the neuronal response, r (that is, AF/F), as 


r-Norm(z, 0) 


Z=f (S) XK +f, (Sy) xk 


Here, s, and s, are the whisker angle, 0, and whisker curvature, K, 
respectively. The terms f, and/f, are static, point-wise nonlinearities 
comprising a weighted sum of 16 triangular basis functions 


16 
f= wb) 
i=l 


in which xis the input (s, or s,), with each b; given by 


(X~ X-)/ 0% — Xa), OF 1 Hg < HSH 
By= 5 Gar 1 OG MES NX SKS Xia 
O, otherwise 


k,and k, are temporal kernels consisting of 14 time points (2s). 

Thus, the model gives a z-scored prediction of neural activity, z, by 
fitting parameters k,, k,, f,and f, for given whisker kinematic param- 
eters, s,ands,. 

The model parameters k,, k,,f, and f, were fit using maximum likeli- 
hood with block coordinate descent. This procedure reliably estimated 
model parameters within three to five iterations. 

To avoid degeneracy associated with arbitrary scaling factors 
on either the kernels or the nonlinearities, the nonlinearities were 
forced to have minimum of 0 and maximum of 1. Temporal kernels 
were unconstrained. 

Aprior was used to ensure smoothness of both the temporal kernels 
andthe nonlinearities and prevent over-fitting. Specifically, the prior 
added a factor to the objective function penalizing excessive second 
derivatives of the temporal kernels and nonlinearities. Employing 
such a prior corresponds to maximizing the log-posterior, with the 
prior adding a small penalty to the objective function. To fit several 
thousand cells efficiently, the scale factor associated with this penalty 
was determined from across-validated inspection of arandom subset 
of neurons. 


The model was fit using fivefold cross-validation across trials. That 
is,arandomly selected 80% of trials were used for fitting, and 20% were 
used for evaluation, with 5 distinct evaluation groups per fit ensuring 
all data was used for fitting in exactly one case. The Pearson correlation 
between the predicted and actual AF/F yielded a measure of the quality 
of the model fit. This correlation was used as the encoding score for 
barrel cortex data: R,ou,,, based on Ax for touch, and Ry risking based on 
whisker 6 for whisking (Fig. 2d, e). Aneuron was considered part of a 
representation if R,ouch OF whisking exceeded 0.1 and if the neuron score 
was above the 99th percentile of scores measured from matched shuf- 
fled neural activity. These criteria are more stringent than those used 
previously’, because ablation predominantly impacted neurons with 
high encoding scores. Using a less stringent criterion did not change 
the underlying conclusions, but did dilute the magnitude of the effect. 
For the response similarity analysis (Fig. 3),a threshold of 0.25 was used 
for the encoding score. 

Shuffled activity was generated by randomizing the timing of the 
calcium events to construct a novel de-noised AF/F trace. Matched 
shuffled activity was selected by using neurons fromthe same imaging 
subvolume (that is, concurrently imaged to ensure identical mouse 
behaviour) sharing a similar event rate. Event rates were matched by 
partitioning the neurons froma subvolume into 10 equally sized bins 
(in terms of neuron count). Thus, in addition to the aforementioned 
encoding score threshold, agiven neuron R value had to exceed the 99th 
percentile of R values obtained across neuronsinthe same subvolume 
and event rate bin whose event times were shuffled. 

We examined robustness by partitioning individual days into two 
interdigitated pseudo-sessions, and measuring the correlation between 
encoding scores for the two pseudo-sessions’. The resulting correla- 
tions ranged between approximately 0.5 and 0.75. Neural classifica- 
tion (whisking and touch) was stable across days for trained mice. 
Furthermore, the touch neuron curvature kernels assumed ‘V’- or ‘L’- 
like shapes’, meaning that high magnitude curvature changes drove 
the largest responses, a result consistent with the known responses 
of these neurons. Finally, the kernels were stable across days’. Thus, 
changes in barrel cortex encoding after ablation are not a reflection 
of the variability inherent to the encoding model, but rather reflect 
genuine changes in the underlying representations. 

Ablated neurons were always excluded from analysis of experimen- 
tal data, including calculations of pre-ablation population encoding 
scores. Analyses involving data before and after ablation were pooled 
across several behavioural sessions. Pre- and post-ablation data each 
consisted of at least two (but typically three) behavioural sessions, 
with each subvolume sampled on most sessions. Given that a single 
subvolume was imaged for around 50 trials in a session, classification 
typically used around 150 behavioural trials per neuron (minimum 
for encoding model: 100 trials). Neurons participating in both repre- 
sentations in a given area were excluded from analysis. In all cases in 
which comparisons of pre-and post-ablation distributions were made, 
neurons were included for analysis if they met the criteria for repre- 
sentation membership (described above) in the pre- or post-ablation 
period, or during both periods. 


Multiphoton ablation 

Ablations”“®” were performed with 880 nm (Chameleon Ultra 2; Coher- 
ent) or 1,040 nm (Fidelity HP; Coherent) femtosecond laser pulses 
delivered through a 0.8 NA, 16x objective (Nikon) in mice that were 
awake but not performing the task or lightly anesthetized using iso- 
flurane (1-2%). On the day of ablation, a new image was acquired and 
a warp field transform was used to find the target neurons**. Imaging 
was not performed onthe day of ablation; post-ablation analyses used 
the imaging data from the 2-4 behavioural sessions on the days after 
ablation. Typically, experiments consisted of three days of pre-ablation 
data collection, the ablation day, and three days of post-ablation data 
collection. 


In virally transfected mice, the mCherry signal was used to restrict 
ablations to pyramidal neurons’; in mice endogenously expressing 
GCaMpP, presence of fluorophore was used for this purpose. The 
neurons with strongest touch or whisking encoding scores within 
the spared barrel column were targeted for ablation. The percent- 
age of touch and whisking neurons ablated was calculated in rela- 
tion to the estimated 1,691 pyramidal neurons in L2/3", of which 17% 
(287) belonged to each representation’. Neurons participating in 
both representations (touch and whisking) were avoided. For silent 
cell ablation, neurons witha calcium event rate below 0.025 Hz were 
targeted for ablation. 

Inall cases, approximately 2/3 of ablations were successful (Extended 
Data Fig. 2c). Proximity to vasculature, low baseline fluorescence, and 
excessive depth accounted for most failures. Thus, despite targeting the 
strongest neurons ina given representation, the actual representation 
strength of ablated neurons varied. The encoding score percentile of 
the ablated neurons is therefore reported in the text. 

After ablation we consistently observed an increase in GCaMP6 
fluorescence inthe targeted neuron (Extended Data Fig. 2a, b). Tak- 
ing advantage of this signature, the ablation protocol consisted of 
interleaved ‘ablation’ and ‘evaluation’ epochs (ablation duration: 
50-200 ms; evaluation: 0.1-2 s, with longer evaluation times prov- 
ing more reliable) (Extended Data Fig. 2b). Ablation epoch power 
started at the evaluation epoch power (25-100 mW, measured at the 
specimen) and rose linearly as necessary (up to1 W) over the course 
of several seconds. The beam was focused on the brightest part 
of the targeted neuron using a pair of galvanometers (Cambridge 
Technology) and oscillated over a path spanning 1-2 pm. Evalua- 
tion epoch fluorescence data were collected using standard reso- 
nant galvanometer imaging’, although restricted to a single plane 
(approximately 30 Hz). Ablations were terminated after observation 
of the fluorescence rise in the target neuron. 

Consistent with similar protocols!°°°, ablation did not produce off- 
target damage: calcium event rates for neurons adjacent (10-25 um) 
to ablated cells did not change after the ablation of silent neurons 
(Extended Data Fig. 3a), and glial immunoreactivity was confined to 
the site of the lesioned neuron (Extended Data Fig. 3b-d). 

Typically, 10-50 ablations were performed over the course of one 
hour. For histological analysis (Extended Data Fig. 3b-d), perfusion 
was performed 24 h after ablation in two Emx1-IRES-cre x LSL-H2B- 
mCherry mice (9 and 28 ablations were successful in these mice, 
approximating typical experimental conditions). Alternating cry- 
omicrotome (Leica) sections were exposed to antibodies for either 
the microglial marker IBA1 (Abcam, ab5076) or the astrocytic marker 
GFAP (Abcam, ab7260). Glial reactions were measured by first locat- 
ing the centre of the ablated neuron in the glial immunoreaction 
image. An edge detection algorithm” that operated on an intensity 
image in angle-distance space from the neuron centre detected the 
extent of the reaction (Extended Data Fig. 3d). Specifically, intensity 
profiles were measured across a range of angles emanating from the 
point within the ablation. To delimit the glial reaction, large dropsin 
intensity were detected. Glia was considered reactive if the intensity 
inside the detected reaction area was two standard deviations above 
background image intensity. 

Following ablation, nearby neurons retained sensory responses 
(Fig. 2f), event rate (Extended Data Fig. 3a), and maintained struc- 
tural integrity (Extended Data Fig. 3b, c, e). Ina few instances (n=5) 
the ablation termination protocol failed, resulting in more extensive 
lesions (Extended Data Fig. 3f). These experiments were excluded 
from the study. 

We excluded ablated neurons as well as neurons within a cylinder 
centred on ablated neurons having a radius of 10 pm and a height of 
60 um from all analyses. This excluded neurons abutting ablated cells, 
and ensured no neurons within a typical glial reaction radius would 
be included. 


Response similarity 

For simulated data, response similarity was measured by taking the 
Pearson correlation of the Gaussian-convolved (kernel standard devia- 
tion, 20 ms) activity of an individual neuron with the mean Gaussian- 
convolved activity of the ablated neurons (Fig. 3a). For experimental 
data, response similarity was measured by correlating the individual 
neuronal trial averaged AF/Fto the ablated neuron trial average mean. 
For eachneuron, trial averaged AF/F was calculated by taking the mean 
AF/Facross all correct proximal and distal pole trials (Fig.3c), then con- 
catenating these two vectors (Fig. 3d). The mean of these vectors across 
the ablated neurons constituted the ablated neuron mean (Fig. 3d). 
Response similarity is simply the Pearson correlation of the individual 
neuron trial averaged AF/F with the mean trial averaged AF/F across all 
ablated neurons. Single-network (Fig. 3b) or mouse (Fig. 3e-g) averages 
werecomputed with response similarity bins having a width of 0.1. The 
grand mean of these is shownas a dark line on these plots. Only strongly 
responding neurons (encoding score > 0.25) were considered in this 
analysis (other analyses use a cut-off value of 0.1). 

Trial-averaged AF/F correlations were used instead of raw activity 
correlations because neurons were not all imaged simultaneously; only 
neurons ina given subvolume were imaged simultaneously. Because 
ablated neurons came from multiple subvolumes, response similarity 
had to use trial-averaged responses to allow for comparison across 
disjointly recorded populations. 


Statistical analyses 

Most statistical comparisons were performed using the Wilcoxon 
signed-rank test comparing paired medians within individual mice 
for two conditions (for example, before and after ablation, or proximal 
and distal encoding score change). For cases in which values had no 
natural pairing (comparison of different ablation types), the Wilcoxon 
rank-sum test comparing medians was used to compare distributions. 
To test whether encoding score change depended on response similar- 
ity (Fig. 3), wefirst fit a line to individual networks or mice (for example, 
Fig. 3e; the linear fit is distinct from the cross-cell mean that is shown). 
Next, wetested whether the slopes thus obtained were distinct from O 
across all networks or mice using the non-parametric sign test. 

In all cases, we used the median of single-neuron values within a 
mouse. That is, we treated mice, and never neurons, as independent 
observations. Where relevant, the total number of neurons included 
across all mice was given. Where given, adjusted MAD was calculated by 
multiplying the median absolute deviation by 1.4826 so as to approxi- 
mate the standard deviation under conditions of normality. Sample 
sizes were similar to those used by others in the field. No statistical 
tests were used to determine sample sizes. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Data can be found at CRCNS (http://crcns.org/) at https://doi. 
org/10.6080/KOZ31WWG. 


Code availability 

Code for the simulations can be found at https://github.com/jwitten- 
bach/ablation-sim. Code used for data analysis can be found at https:// 
github.com/peronlab/ablation. 
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Extended Data Fig. 1| Effect of ablation in a simulated network with hyper- 
connectivity (that is, P..,,,, > 0.4; here, Pon, =0.44). Model network responses 
aligned to input (arrow, bottom). Raster plots showa subset of neurons from an 
example network. PSTHs show averages across all neurons and networks. 
Bottom, excitatory neurons within subnetwork. Middle, excitatory neurons 
outside subnetwork. Top, inhibitory neurons. Left, network response before 
ablation of 10% of the subnetwork neurons; right, response after ablation. 
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Extended Data Fig. 2 | Multi-photon ablation protocol. a, Example field of depth for all experiments includedin this dataset. Individual points give mean 
view immediately before (left) and after (right) ablation. Orange arrowdenotes __ successrate for given depth bin; bin size, 25 um.d, Depth dependence of total 
target neuron.b, Ablation protocol. Power (orange, top trace) during ablation energy deposition for successful ablations (successfully ablated cells only:n 
epochs (100 ms; increased power, orange, top) gradually increased, andthe = 293 cells across 22 sessions, 14 mice; ablations from j250220 and j257218, 
PMT shutter was closed (black bars). During the intervening evaluationepochs, alongwith 7 additional ablations from other mice were excluded owing to 
power was lower and constant (orange, top), and the PMT shutter was open. incomplete logging). Grey dots denote individual ablations. Black dots, means 
Ablation was terminated when GCaMP fluorescence at the target neuron across 25-y1m bins. e, Asind, but for peak power needed for ablation 


(green) jumped (orange arrow).c, Success of ablation as a function of neuron (n=293 cells across 22 sessions, 14 mice). 
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Extended Data Fig. 3 | Multi-photon ablation produces spatially localized 
effects. a, Change in calcium event rate (Methods) as a function of minimal 
distance to an ablated neuron after silent cell ablation. Individual neurons 
appear as grey points, with dark grey dashed lines showing single mouse 
averages and the dark black line showing the cross-mouse (n=8 silent ablation 
mice) average. Event rates among neurons adjacent to ablated silent cells 

did not change (event rate before ablation: 0.014 + 0.008 Hz, grand 

median + adjusted MAD; after ablation: 0.014 + 0.007 Hz; P=0.055 before 
versus after ablation, Wilcoxon signed-rank test, paired medians acrossn=8 
mice; 1,028 neurons across all mice) (Methods). b, Confocal ex vivo image 
froma mouse perfused 24h after ablation. Ablation sites are indicated 

with dashed white circles. Green, GCaMP6s fluorescence; red, mCherry 
fluorescence; blue, microglial antibody IBA1 fluorescence. c, Asinb, but blue 
showsimmunoreactivity for the astrocytic marker GFAP. d, The spatial extent 
of glial reaction was measured by detecting the fastest intensity decline ridge 


(dashed white line) in the glial antibody image along lines emanating from 

the ablation centre at varied angles (Methods). Top, ridge along ablation from 
b. Bottom left, intensity image in angle-distance space within which the ridge 
was measured. Bottom right, distribution of reaction radii; all points constitute 
IBA1 labelling, as no detectable glial reactions were observed with GFAP: 

8 out of 1OIBA1-labelled and 0 out of 6 GFAP-labelled ablations retrieved 
histologically revealed a detectable glial reaction (Methods). These reactions 
had radiiof11.7 + 1.7 1m (mean ts.d.;n=8 sites).e, Two-photon in vivo images 
before (left) and after (right) asuccessful ablation (target, white dotted line). 
Green, GCaMPé6s fluorescence; red, mCherry fluorescence. f, Asine, butaftera 
failure of the ablation protocol toterminate the ablation. Excess energy 
deposition produceda large lesion (black, centre of image). White arrows 
denote corresponding points in the twoimages. Mice (n=5) withsuch lesions 
were excluded from thestudy. 
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Extended Data Fig. 4| Effect of ablation on L2/3 model sensory 
representation increases with the number of ablated neurons. Ablation 
effect (change in Rgimutus) aS a Function of the degree of touch representation 
degradation (net Retimurus ACrOSS ablated neurons). In all cases, we used increased 
subnetwork connectivity (0.4) (Fig. 1, Methods). The napiacea Neurons with the 
highest encoding score were selected for ablation. Grey circles denote 
individual networks. Black dots denote, median across n=30 simulated 
networks fora given number of ablated neurons, indicated in the plot. Beyond 
Navlatea > 50, we Observed instability, presumably because we did not attempt to 
restore excitatory-inhibitory balance after ablation; these data were omitted. 
Correlation of net Remus ADlated and AR gtimutus: R=—0.65, P< 0.01. 
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Extended Data Fig. 5 | Ablation of touch neurons does not produce distance- 
dependent effects in the whisking representation. Proximal (15-35 umto 
nearest ablated cell) change in Pyrisking: 0.000 + 0.192 (grand median + adjusted 
MAD); distal (115-135 pm): -0.023 + 0.186. Pvalue given for Wilcoxon signed- 
rank test comparing proximal paired to distal (n=9 mice). Legend asin Fig. 2m. 
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Extended Data Fig. 7 | Example effects of ablations of whisking and silent is indicated by a faint orange background. Centre, Ryouch (COP) ANd Rynisking 
neurons. a, Example ablation of a whisking neuron. Left, example maps for (bottom) values for the ablated population. Right, Rroucn (top) and Rynisking 
touch (top, blue) and whisking (bottom, green) cells before ablation. Sphere (bottom) values after ablation, with ablated neurons again excluded. b, Asin 
size correspondsto Ryouch (tOp) OF Rynisking (bottom) values. Grey dots denote a, but for ablation of silent neurons. 
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increasing subnetwork connectivity. Bottom totop, raster plots (eachshowing network (n=30 networks). 


Extended Data Table 1 | Individual mice are listed with the number of neurons imaged, the type(s) of ablation(s) performed, 
and the number of neurons ablated 


AnimalID_ Cell Ablated type Ablated 
count count Touch AR touch Whisking AR whisking 
count count 
j250220 8,844 touch 50 867 -0.122 422 -0.076 
j257218 6,215 silent 20 743 0.018 366 0.032 
j257220 3,663 touch 9 389 -0.004 455 0.025 
j258836 7,290 silent 16 481 0.066 666 -0.012 
touch 1S 488 -0.055 655 0.001 
j271211 9.415 silent 16 2,476 -0.012 551 -0.007 
touch 15 1,951 -0.061 692 0.021 
j278937 4,230 silent 15 146 0.004 379 0.001 
whisking 12, 150 -0.006 392 0.011 
j278939 7,167 silent 20 636 -0.024 698 0.001 
whisking 13 523 -0.017 649 -0.015 
j281915 7,002 whisking 8 723 -0.010 550 0.004 
touch 11 601 -0.022 672 0.008 
n274424 7,785 touch 17 706 -0.029 718 -0.014 
n272761 6,359 touch 14 509 -0.001 410 0.000 
n274577 12,173 touch 11 1,380 -0.013 657 0.004 
n275801 8,354 silent 14 220 0.000 738 -0.007 
touch 9 197 -0.065 698 -0.006 
n275798 11,364 silent 13 959 0.002 958 0.002 
whisking 11 972 0.008 1,037 0.011 
n278288 9,314 whisking 25 969 -0.008 813 -0.006 
n276013 11,659 silent 16 652 0.028 1,072 -0.001 
whisking 9 634 -0.012 968 -0.008 


n278759 9,187 whisking 11 617 -0.003 526 -0.004 


The number of touch and whisking neurons is given (for the 0.1 encoding score criteria) (Methods), as well as the change in median encoding score after ablation. Mice with IDs starting with a ‘j’ 
were virally transfected Emx1-cre x LSL-H2B-mCherry’; mice with IDs starting with an ‘n’ were Ai162 x Slc17a7-IRES-cre“°. 
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The production of pore-forming toxins that disrupt the plasma membrane of host 
cells isacommon virulence strategy for bacterial pathogens suchas methicillin- 
resistant Staphylococcus aureus (MRSA)*®. It is unclear, however, whether host 


species possess innate immune mechanisms that can neutralize pore-forming toxins 
during infection. We previously showed that the autophagy protein ATGI6L1is 
necessary for protection against MRSA strains encoding a-toxin*—a pore-forming 
toxin that binds the metalloprotease ADAM10 on the surface of a broad range of target 
cells and tissues***. Autophagy typically involves the targeting of cytosolic material to 
the lysosome for degradation. Here we demonstrate that ATGI6L1 and other ATG 
proteins mediate protection against a-toxin through the release of ADAM10 on 
exosomes—extracellular vesicles of endosomal origin. Bacterial DNA and CpG DNA 
induce the secretion of ADAMI10-bearing exosomes from human cells as well as in 
mice. Transferred exosomes protect host cells in vitro by serving as scavengers that 
can bind multiple toxins, and improve the survival of mice infected with MRSA in vivo. 
These findings indicate that ATG proteins mediate a previously unknown form of 
defence in response to infection, facilitating the release of exosomes that serve as 
decoys for bacterially produced toxins. 


We previously demonstrated that primary cells obtained from mice 
with hypomorphic expression of Atg16I1 (Atg16l1) display anincrease 
in total ADAMI1O levels and are susceptible to lysis when cultured in 
the presence of a-toxin*. Consistent with these findings, we have 
now found that levels of cell-surface and total ADAM10 are increased 
in the human alveolar epithelial cell line A549 upon short hairpin 
(sh)RNA-mediated depletion of ATGI6L1 (Fig. la—d). ATGI6L1-knock- 
down cells treated with purified a-toxin displayed increased cell death 
compared with control cells transduced with nontargeting shRNA, 
whereas ADAMI0-knockdown cells were resistant (Fig. le). ATG16L1 
mediates the conjugation of phosphatidylethanolamine to the ubiqui- 
tin-like molecule LC3—a step that is necessary for the proper biogenesis 
of the autophagosome and for subsequent events in which substrates 
are degraded by the lysosome’. Inhibiting ULK1, a kinase upstream of 
ATGI6LI, or ATGS, a binding partner of ATG16L1, led to increased cell- 
surface ADAMI1O levels similar to those produced by knocking down 
ATGI6L1 (Fig. 1f). Prevention of lysosomal acidification by weak bases 
alters endosomal recycling tothe plasma membrane®”’. Although total 
levels of ADAM10 and the autophagy substrate SQSTM1 were increased 
when A549 cells were treated with lysosomal acidification inhibitors 
(NH, CI, chloroquine or bafilomycin), all three agents decreased sur- 
face ADAM10 levels (Fig. 1g, h and Extended Data Fig. la-d). Surface 
levels of epithelial cell adhesion molecule (EDCAM) were unaltered, 
indicating that lysosome inhibition did not affect all plasma-membrane 


molecules (Extended Data Fig. le-g). ADAMI10 levels were unaffected 
by proteasome inhibition (Extended Data Fig. 1h, i), suggesting that 
ATG proteins reduce cell-surface ADAM10 through a lysosome- and 
proteasome-independent process. 

ATG proteins mediate the extracellular release of soluble and vesicle- 
bound substrates through a process broadly referred to as secretory 
autophagy’®. ADAM1O is known to be incorporated into exosomes— 
extracellular vesicles typically 40-120 nm in diameter". Thus, we 
hypothesized that the autophagy machinery prevents ADAMI1O accu- 
mulation on cells by facilitating its secretion on exosomes. We found 
a reduction inthe lower-molecular-weight band of ADAMI10 (amature 
form cleaved during trafficking from the endoplasmic reticulum) in 
exosome fractions isolated from the culture supernatants of ATGI6L1- 
knockdown cells compared with control cells treated with nontargeting 
shRNA (Fig. 2a, b and Extended Data Fig. 2a). Western blot analysis 
confirmed that the fractionation procedure led to enrichment of the 
exosomal marker CD9 and not the microvesicle marker ARF6 (Extended 
Data Fig. 2b). Parallel analysis by transmission electron microscopy 
(TEM) indicated that the exosome fraction contained a greater number 
of single-lipid-bilayer vesicles of 80-150 nm in diameter compared with 
microvesicles larger than 150 nm (Extended Data Fig. 2c-e). 

The decrease in ADAM10 that occurs after ATG16L1 inhibition reflects 
a general reduction in exosome levels: we observed a reduction in CD9 
levels by western blot anda reduction inthe number of vesicles by TEM 
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Fig. 1| ATG16L1inhibits surface ADAM10 independently of lysosomal 
degradation. a, b, Representative flow-cytometry histogram (a) and 
quantification of mean fluorescent intensity (MFI) (b) of surface ADAM1O in 
A459 cells following ATGI6L1 knockdown (ATGI6L1KD; n=5);or incells 
containing nontargeting control shRNA (nt shRNA; n=10).c,d, Representative 
western blot (c) and quantification (d) of ADAM10 in ATG16L1 KD and control 
cells;n=3.a.u., arbitrary units. e, Quantification of cell death (assayed by 
release of lactate dehydrogenase, LDH) of nt shRNA, ATGI16L1 KD and ADAM10 
KD cells following treatment with purified a-toxin; n=4. f, Quantification of 
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Fig.2| ATG proteins regulate the release of ADAM10-containing exosomes. 
a-c, Representative ADAM10 and CD9 western blot from three independently 
repeated experiments (a); quantification of exosome ADAM10,n=3 (b); and 
quantification of CD9 incelllysates and exosomes from nt shRNA (n=7) and 
ATGI6L1KD (n=3) cells (c).d, e, Representative transmission electron 
micrographs (d) and quantification (e) of vesicles inthe exosome fraction of nt 
shRNA and ATGI6L1KD culture supernatants. Scale bars, 100 um;n=80 
images. Ctrl, control. f, Flow-cytometric quantification of exosomes from 
untreated (n=4), ntshRNA (n=3), ATGI6L1 KD (n=7), ULK1KD (n=6) and ATG7 
KD (n=6) A549 cells. g, Quantification of ADAM10 MFlinuntreated, nt shRNA 
and ATGI6L1KD exosomes from f. n=3.h, Exosome quantification (CD9, 
CD63", CD81° and PKH67‘ structures) in blood from C57BL/6) (wild-type (WT); 


surface ADAM1O by flowcytometry innt shRNA (n=3), ATGS KD (n=3) and ULK1 
KD (n=4) A549 cells. g,h, Representative flow-cytometry histogram from 
three independent repeats of surface ADAM10 0n A549 cells 24h after 
treatment with bafilomycin (BAF; 10 nM) (g), and quantification of MFl over 
time following addition of BAF (h; n=3). PBS, phosphate-buffered saline. 
Measurements were taken from distinct samples, and graphs show means and 
standard errors of the mean (s.e.m.).b, d, f,h, Two-tailed, unpaired t-test with 
Welch’s correctioncompared with nt shRNA controls. e, Two-tailed, unpaired 
t-test of area under curve compared with ntshRNA controls. 
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n=6) and ATGI16L1 hypomorph (HM; n=8) mice. i, Exosome quantification 
following addition of PBS (n=4), chloroquine (CQ; n=5) or BAF (n=9). 

j, Representative western blot from three independent repeats analysing 
ADAMI10, SQSTM1and LC3ll levels in nt shRNA and STX17 KD cells. k, ADAM10 
MFlof nt shRNA (n=5) and STX17 KD (n=6) cells. 1, Exosome quantification 
fromnt shRNA and STX17 KD cells;n=8. Measurements were taken from 
distinct samples and graphs show means +¢ s.e.m.b,c,e,h,I, Two-tailed, 
unpaired ¢-test with Welch’s correction compared with nt shRNA or WT 
controls. f, g, i, One-way analysis of variance (ANOVA) with Dunnet’s post-test 
compared with nt shRNA or PBS. Data represent at least three independent 
experiments. 
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inthe exosome fraction of ATGI6L1-knockdown cell-culture superna- 
tant (Fig. 2a, c—-e). To further validate these results through a quantita- 
tive assay, we used flowcytometry in which antibody-based staining of 
the surface exosome markers CD9, CD63 and CD81was combined with 
PKH67, a fluorescent lipid-bilayer-intercalating compound (Extended 
Data Fig. 2f). Depletion of autophagy proteins substantially reduced the 
total numbers of exosomes in the culture supernatant (Fig. 2f). ATGI6L1- 
knockdown reduced the total number of ADAM10-positive exosomes 
but not the amount of ADAMI10 per exosome (Fig. 2g), confirming that 
the ATG proteins regulate exosome biogenesis rather than substrate 
incorporation. We also found that the blood from Atg16l1™ mice con- 
tained fewer exosomes than blood from wild-type mice (Fig. 2h). 

Our finding that blocking lysosomal acidification decreases plasma- 
membrane ADAM1O levels could be explained bya mechanism in which 
inhibiting late-stage autophagy redirects the autophagy machinery 
towards generation of exosomes” *. Consistent with this possibil- 
ity, we detected increases in CD9 and ADAMI1O levels in the exosome 
fraction as well as anincrease in total exosome numbers inthe culture 
supernatant of cells treated with chloroquine or bafilomycin (Fig. 2iand 
Extended Data Fig. 3a—d). The SNARE protein syntaxin 17 (STX17) medi- 
ates autophagosome-lysosome fusion and is dispensable for secretory 
autophagy!*””. STX17 knockdown increased total ADAM10, SQSTM1and 
LC3ll levels, indicating successful inhibition of autophagy, without 
increasing surface levels of ADAM1O (Fig. 2j, k). However, supernatants 
from STX17-knockdown cells contained more exosomes (Fig. 21), indi- 
cating that ATG proteins mediate the release of exosomes ina manner 
distinct from conventional degradative autophagy. 

We next examined whether ATG-dependent exosome production 
is induced by pathogen exposure. Heat-killed S. aureus (CA-MRSA 
USA300, hereafter HKSA), an isogenic a-toxin-deficient USA300 
strain (Ahla), Streptococcus pneumoniae, Citrobacter rodentium and 
Salmonella enterica Typhimurium all increased exosome production 
inhuman and mouse cells (Fig. 3a and Extended Data Fig. 4a, i). After 
testing several bacterially derived products, we indentified bacterial 
DNA and CpG DNAas the exosome inducer (Fig. 3b and Extended Data 
Fig. 4b-g). Furthermore, addition of DNA isolated from S. aureus to 
cells elicited exosomes, and DNase treatment abolished this effect 
(Extended Data Fig. 4j). Exosome production in response to HKSA and 
CpG DNA depended on the endosomal DNA-sensor Toll-like receptor 
9 (TLR9Y) (Fig. 3c and Extended Data Fig. 4h). Inducing autophagy with 
Torin-1—an inhibitor of mammalian target of rapamycin (mTOR)—did 
not induce exosomes, suggesting that TLR9 acts through a distinct 
mechanism (Extended Data Fig. 4k, |). Instead, the addition of CpG DNA 
or bafilomycin (a positive control) individually or together decreased 
LysoSensor staining, an indicator of acidic organelles (Extended Data 
Fig. 5a—c). We also found that treating cells with the neutral sphingomy- 
elinase inhibitor GW4869—which prevents the generation of vesicles 
that become exosomes by interfering with the inward budding of the 
multivesicular body (MVB)'8—impairs CpG-DNA-induced exosome 
production (Extended Data Fig. 4m). Thus, the membrane-trafficking 
events downstream of TLR9 probably contribute to exosome produc- 
tion by regulating endosomal trafficking and vesicle-biogenesis events 
that include the MVB. 

Intravenous injection of heat-killed or live S. aureus into wild-type 
mice led toamarked increase inthe number of exosomes intheir blood 
that was blunted in Atg16/1 mice, but not in mice in which Atg16l1 was 
selectively deleted in macrophage and dendritic-cell lineages (Fig. 3d, 
eand Extended Data Fig. 4n, 0). This observation is consistent with our 
previous study in which Atg16l1™ mice, but not myeloid-cell-specific 
Atgi16l1 knockout mice, were susceptible to lethal bloodstream infection 
by MRSA‘. Next, we performed mass spectrometry on exosomes from the 
blood of mice inoculated intranasally with HKSA or CpG DNA (Extended 
Data Fig.4p). The majority of detected proteins originated from the liver 
and were previously identified in exosomes and extracellular spaces 
(Fig. 3f-h and Supplementary Tables 1-3). We confirmed that the liver 
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Fig. 3 | Bacteriainduce exosome production. a-e, Flow-cytometric 
quantification of exosomes in A549 cell-culture supernatant 18 h after 
exposure to heat-killed S. aureus (n=7), heat-killed S. pneumoniae (n=5), heat- 
killed C. rodentium (n=4), heat-killed S. Typhimurium (n=3) (a); after CpG DNA 
treatment (4 1M; n=5) (b); innt shRNA (n=6) and TLR9 shRNA (TLR9 KD; n=3) 
targeted A549 cells following HKSA exposure (c); in blood from wild-type and 
Atg1611™ mice following intranasal (i.n.) inoculation with HKSA (1 x 10° colony- 
forming units (CFU); WT plus PBS, n=7; WT plus HKSA, n=9;HM plus PBS, n=2; 
HM plus HKSA, n=4) (d); or following intravenous (i.v.) inoculation with live S. 
aureus (1x 10’ CFU; WT plus PBS, n=5; WT plus HKSA, n=10; HM plus PBS, n=3; 
HM plus HKSA, n=6) (e).f, Venn diagram of shared and discreet proteins 
identified by mass spectrometry in exosomes isolated from the blood of mice 
exposed to HKSA or CpG DNAi.n. (1x 10° CFU; 20 pg CpG DNA). g, Gene- 
ontology analysis of the subcellular location of proteins identified by mass 
spectrometry. h, Tissue-specific origin of exosome proteins. Measurements 
were taken from distinct samples and graphs show means +s.e.m.a, b, Two- 
tailed, unpaired t-test with Welch’s correction compared with PBS controls. 
c-e, One-way ANOVA with Dunnet’s post-test compared with nt shRNA plus 
PBS, or WT plus PBS controls. 


enzyme argininosuccinate synthase 1 (ASS1) was enriched in HKSA, and 
that CpG DNA elicited exosomes in vivo” (Extended Data Fig. 4p). 
Next, we tested whether these released vesicles could serve as a host 
response to bind and inhibit toxins. We found that exosomes isolated 
from control donor cells, but not fromA7G16L1-knockdown cells, were 
able to protect A549 target cells from a-toxin toxicity (Fig. 4a). Add- 
ing twice the volume of the supernatant of ATGI6L1-knockdown cells 
fromwhich exosomes were isolatedimproved the viability of the cells, 
indicating that the inability of exosomes from ATGI6L1-knockdown 
cells to protect cells was due to areduction inthe number of exosomes. 
Exosomes harvested from ADAM10-knockdown cells were unable to 
protect cells (Fig. 4a and Extended Data Fig. 6a). These results were 
confirmed with exosomes purified through fluorescence-activated 
cell sorting (FACS; Fig. 4b). Of note, preincubating cells with HKSA 
or CpG DNA also protected against a-toxin toxicity (Extended Data 
Fig. 6b). This protection was due to exosomes: removing the exosome- 
containing supernatant restored susceptibility to a-toxin in HKSA 
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Fig. 4| Exosomes protect against bacterial toxins. a, b, A549 cell death 
following treatment with a-toxin together with exosomes isolated from nt 
shRNA (n=6), ATGI6L1 KD (n= 6), ATGI6L1 KD x 2 (n=3), ADAM10 KD (n=5) cells 
(a); or with FACS-purified exosomes (n=4) (b).c, d, Representative western 
blot (c) and quantification (d) of oligomerized o-toxin larger than 130 kDa 
following addition of exosomes isolated from WT or ADAM10 KD cells. n=3. 

e, BMDM death following treatment with LukED and exosomes isolated from 
WT BMDMcultures (LukED only, n=10; LukED plus WT exosomes, n= 16). 

f, A549 cell death following exposure to diphtheria toxin (DPT) and exosomes 
isolated from A549 cultures. n=12.g, Survival of WT mice infected i.v. with 

S. aureus(USA300; 5 x10’ CFU) mock-treated or injected intraperitoneally with 
exosomes from WT mice. n=9 mice per condition. h, Survival of WT (n=10) and 


or CpG-DNA-treated cells (Extended Data Fig. 6b). Exosomes seem 
to protect cells by inducing toxin oligomerization on the exosome 
membranes (Fig. 4c, d). 

Exosomes elicited from mouse bone-marrow-derived macrophages 
(BMDMs) harbour the toxin receptor CCR5Sand protected BMDMs from 
LukED (Fig. 4e and Extended Data Fig. 6c), another toxin produced by 
S. aureus*®. Similarly, exosomes isolated from A549 cells protected 
target cells from diphtheria toxin (Fig. 4f), a potent toxin produced 
by Corynebacterium diphtheriae that binds to the epidermal growth 
factor receptor (EGFR), which was present in our exosome proteom- 
ics dataset (Supplementary Table 1). Thus, exosomes can neutralize 
different types of toxin. 

To test whether exosomes are protective in vivo, we injected donor 
mice with HKSA to elicit exosomes in the blood; we then transferred 
these exosomes into recipient mice and infected the animals intra- 
venously with a lethal dose of S. aureus. Transfer of exosomes from 


Atg1611 (Mock to HM, n=10; WT exosomes to HM, n=10) miceinfected iv. 
with2.5 x10’ CFU of S. aureusand receiving exosomes from WT mice. NS, not 
significant. i, Survival of WT mice (n=10) pretreated with intranasal HKSA 
followed bya lethal dose of S. aureus (strain USA300; 5 x 10’ CFU; n=10) oran 
isogenic a-toxin-deficient strain (Ahla;n=5).j, S. aureusburden 24 hafter 
infection with1 x 10’ CFU of USA300 iv. in kidney, spleen, lung and blood (per 
millilitre) in mice pretreated with PBS or HKSAi.v.;n=6. Measurements 

were taken from distinct samples. LOD, limit of detection. Graphs show 
means +s.e.m.a,b, One-way ANOVA with Dunnet’s post-test compared with 
a-toxin only or control exosomes. d-f,j, Two-tailed, unpaired t-test with 
Welch’s correction compared with nt shRNA exosomes, a-toxin only or PBS 
controls. g-i, log-rank Mantel-Cox test. 


wild-type but not Atg16/1 donors extended the survival of S. aureus- 
infected wild-type recipient mice (Fig. 4g and Extended Data Fig. 6d, e). 
Furthermore, transfer of exosomes from a wild-type donor improved 
the survival of Atg16l1™ mice injected with lethal dose 50 of S. aureus 
to levels similar to those of mock-treated wild-type mice (Fig. 4h), sug- 
gesting that the increased susceptibility of Atg/6/1 mutants is in part 
due to reduced exosome production. 

Finally, priming mice with intravenous injection of HKSA increased the 
relative amount of a-toxin oligomers to monomers in the exosome frac- 
tionisolated fromthe bronchoalveolar lavage (BAL) fluid compared with 
mock-treated controls (Extended Data Fig. 6f-k). Additionally, wefound 
that conditioning the mice with HKSA prolonged survival following S. 
aureus infection, phenocopying control mice infected with the Ahla 
strain (Fig. 4i). To monitor bacterial burden, we challenged mice with 
alower inoculum, and found that conditioning with HKSA resulted in 
reduction in S. aureus burdens in the kidneys and blood (Fig. 4)). 
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Exosomes extend the functional reach of their cells of origin and 
contribute to a spectrum of biological processes”*”’. Our results sug- 
gest that exosomes can also provide a previously undescribed innate 
immune response to bacterial infection by acting as decoys that trap 
membrane-acting virulence factors, such as pore-forming toxins, to 
prevent injury of target tissues. The action of these ‘defensosomes is 
in line with recent evidence that engineered liposomes can neutral- 
ize toxins from Gram-positive bacteria’. Our findings also indicate 
that ATG proteins regulate the production of exosomes during host 
defence—a mechanism that is distinct from the recently uncovered 
roles of ATGI6L1 in promoting plasma-membrane repair during Lis- 
teria monocytogenes infection, or inpreventing necroptosis mediated 
by the pore-forming molecule MLKL during norovirus infection”>”°. 
Rather, our findings resemble recently described processes in which 
extracellular vesicles are produced when the atypical ATG12—ATG3. 
conjugate promotes MVB function, or when ATGS disrupts lysosomal 
acidification”’”*. Given that the origin and regulation of extracellular 
vesicles remain poorly defined, a detailed understanding of the cellular 
response triggering the production of defensosomes during infection 
may reveal opportunities to leverage their unique properties to combat 
bacteria and other pathogens. 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 
acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-020-2066-6. 


1. Dal Peraro, M. & van der Goot, F. G. Pore-forming toxins: ancient, but never really out of 
fashion. Nat. Rev. Microbiol. 14, 77-92 (2016). 

2. — Seilie, E. S. & Bubeck Wardenburg, J. Staphylococcus aureus pore-forming toxins: the 
interface of pathogen and host complexity. Semin. Cell Dev. Biol. 72, 101-116 (2017). 

3. Spaan, A. N., van Strijp, J. A. G. & Torres, V. J. Leukocidins: staphylococcal bi-component 
pore-forming toxins find their receptors. Nat. Rev. Microbiol. 15, 435-447 (2017). 

4. Maurer, K. etal. Autophagy mediates tolerance to Staphylococcus aureus alpha-toxin. 
Cell Host Microbe 17, 429-440 (2015). 

5. Inoshima, |. et al. A Staphylococcus aureus pore-forming toxin subverts the activity of 
ADAM10O to cause lethal infection in mice. Nat. Med. 17, 1310-1314 (2011). 

6. Becker, R. E., Berube, B. J., Sampedro, G. R., DeDent, A. C. & Bubeck Wardenburg, J. 
Tissue-specific patterning of host innate immune responses by Staphylococcus aureus 
a-toxin. J. Innate | mmun. 6, 619-631 (2014). 

7. Matsuzawa-lshimoto, Y., Hwang, S. & Cadwell, K. Autophagy and inflammation. Annu. 
Rev. Immunol. 36, 73-101 (2018). 


264 | Nature | Vol579 | 12 March 2020 


8. Presley, J. F., Mayor, S., McGraw, T. E., Dunn, K. W. & Maxfield, F. R. Bafilomycin A1 
treatment retards transferrin receptor recycling more than bulk membrane recycling. 
272, 13929-13936 (1997). 

9. Johnson, L. S., Dunn, K. W., Pytowski, B. & McGraw, T. E. Endosome acidification and 
receptor trafficking: bafilomycin A1 slows receptor externalization by a mechanism 
involving the receptor's internalization motif. 4, 1251-1266 (1993). 

10. Cadwell, K. & Debnath, J. Beyond self-eating: the control of nonautophagic functions and 
signaling pathways by autophagy-related proteins. J. Cell Biol. 217, 813-822 (2018). 

11. Stoeck, A. et al. A role for exosomes in the constitutive and stimulus-induced ectodomain 
cleavage of L1 and CD44. Biochem. J. 393, 609-618 (2006). 

12. Shimoda, M. et al. Loss of the Timp gene family is sufficient for the acquisition of the 
CAF-like cell state. Nat. Cell Biol. 16, 889-901 (2014). 

13. Ejlerskov, P. et al. Tubulin polymerization-promoting protein (TPPP/p25a) promotes 
unconventional secretion of a-synuclein through exophagy by impairing 
autophagosome-lysosome fusion. J. Biol. Chem. 288, 17313-17335 (2013). 

14. Guo, H. et al. Atg5 disassociates the V1VO-ATPase to promote exosome production and 
tumor metastasis independent of canonical macroautophagy. Dev. Cell 43, 716-730 
(2017). 

15. Miao, Y., Li, G., Zhang, X., Xu, H. & Abraham, S. N. A TRP channel senses lysosome 
neutralization by pathogens to trigger their expulsion. Cell 161, 1306-1319 (2015). 

16. Itakura, E., Kishi-ltakura, C. & Mizushima, N. The hairpin-type tail-anchored SNARE 
syntaxin 17 targets to autophagosomes for fusion with endosomes/lysosomes. Cell 151, 
1256-1269 (2012). 

17. Kimura, T. et al. Dedicated SNAREs and specialized TRIM cargo receptors mediate 
secretory autophagy. EMBO J. 36, 42-60 (2017). 

18. Trajkovic, K. et al. Ceramide triggers budding of exosome vesicles into multivesicular 
endosomes. Science 319, 1244-1247 (2008). 

19. Cho, Y. E. et al. Increased liver-specific proteins in circulating extracellular vesicles as 
potential biomarkers for drug- and alcohol-induced liver injury. PLoS One 12, e0172463 
(2017). 

20. Alonzo, F., Ill et al. CCR5 is a receptor for Staphylococcus aureus leukotoxin ED. Nature 
493, 51-55 (2013). 

21. Mitamura, T., Higashiyama, S., Taniguchi, N., Klagsbrun, M. & Mekada, E. Diphtheria toxin 
binds to the epidermal growth factor (EGF)-like domain of human heparin-binding 
EGF-like growth factor/diphtheria toxin receptor and inhibits specifically its mitogenic 
activity. J. Biol. Chem. 270, 1015-1019 (1995). 

22. Montecalvo, A. et al. Mechanism of transfer of functional microRNAs between mouse 
dendritic cells via exosomes. Blood 119, 756-766 (2012). 

23. Kalluri, R. The biology and function of exosomes in cancer. J. Clin. Invest. 126, 1208-1215 
(2016). 

24. Henry, B. D. et al. Engineered liposomes sequester bacterial exotoxins and protect from 
severe invasive infections in mice. Nat. Biotechnol. 33, 81-88 (2015). 

25. Tan,J.M.J. etal. An ATG16L1-dependent pathway promotes plasma membrane repair and 
limits Listeria monocytogenes cell-to-cell spread. Nat. Microbiol. 3, 1472-1485 (2018). 

26. Matsuzawa-lshimoto, Y. et al. Autophagy protein ATG16L1 prevents necroptosis in the 
intestinal epithelium. J. Exp. Med. 214, 3687-3705 (2017). 

27. Murrow, L., Malhotra, R. & Debnath, J. ATG12-ATG3 interacts with Alix to promote basal 
autophagic flux and late endosome function. Nat. Cell Biol. 17, 300-310 (2015). 

28. Guo, H. et al. Atg5 disassociates the V,V.-ATPase to promote exosome production and 
tumor metastasis independent of canonical macroautophagy. Dev. Cell 43, 716-730 
(2017). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2020 


Methods 


Mice 

Age- and gender-matched 8-12-week-old mice on the C57BL/6J back- 
ground were used. Atg/6L1'" mice on the C57BL/6J background were 
previously described*”*”°. ‘Wild-type’ refers to littermate controls 
generated from breeder pairs that were heterozygous for the Atg16L1'™ 
allele for experiments that involve comparisons between genotype. For 
other experiments, C57BL/6J mice were purchased from the Jackson 
Laboratory and bred onsite. All animal studies were performed accord- 
ing to approved protocols and ethical guidelines established by the 
NYU School of Medicine Institutional Animal Careand Use Committee 
(IACUC) and Institutional Review Board. 


Cell lines 

For in vitro studies, the human lung epithelial cell line, A549, was pur- 
chased from ATCC (catalogue number CCL-185). Human embryonal 
kidney cells, HEK293FT, purchased from ThermoFisher (catalogue 
number R70007), were used for lentiviral packaging. All cell lines were 
confirmed as free from mycoplasma contamination. 


Bacterial growth 

S. aureus strain LAC/USA300 was grown overnight in tryptic soy broth 
(TSB) with shaking at 37 °C and diluted 1/100 followed by an additional 
3-4h of growth until bacteria reached an optical density of 2. S. pneu- 
moniae strain D39 was grown overnight in Luria-Bertani (LB) broth at 
37 °C. The following day S. pneumoniae was diluted 1/50 followed by 
4hof growth until the optical density reached 0.5. C. rodentium and 
S. Typhimurium were grown overnight in LB broth at 37 °C. The fol- 
lowing day, both were diluted 1/10 followed by 4 h of growth until the 
optical density reached 2.0. Bacterial density was confirmed by dilu- 
tion plating. 1 x 10° CFU of each bacterial strain were boiled at 95 °C for 
2 hand resuspended in PBS for experiments with heat-killed bacteria. 


shRNA knockdown 

Lentivirus-based knockdown of human A7TGI6L1 (5’-CCGGACTG 
TAGCTTTGCCGTGAATGCTCGAGCAT TCACGGCAAAGCTACAGTT TTT 
TTG-3’), ULK1 (5’-CCGGGCCCTTTGCGTTATAT TGTATCTCGAGATA 
CAATATAACGCAAAGGGCTTTTT-3’), ATGS (5’-CCGGGAT TCATGGA 
ATTGAGCCAATCTCGAGAT TGGCTCAATTCCATGAATCTTTTTIG-3’), 
ATG7 (5’-CCGGGCTTTGGGATT TGACACAT T TCTCGAGAAATGTGTC 
AAATCCCAAAGCTTTTT-3’), ADAMI10 (5’-CCGGCCAGGTGGAATTACTTA 
ATTCTCGAAGAATTTAAGTAATTCCTGGTTTTT-3’) and nontargeting 
control were performed using MISSION shRNA constructs (Sigma- 
Aldrich) as described”. Viruses expressing shRNAs were produced by 
DNA transfection via Lipofectamine 3000 (ThermoFisher). Successful 
knockdown was confirmed by western blot and/or reverse transcription 
(RT) with quantitative polymerase chain reaction (qPCR). 


Flowcytometry 

A549 cells were stained for surface markers ADAM10 (human SHM14) 
and EpCAM (human 9C4) using antibodies from BioLegend. A fixable 
live/dead stain from BioLegend was used to exclude dead cells. For 
profiling, exosome pellets were resuspended in 100 pl of PBS and were 
stained with a combination of CD9 (human HI9a, mouse MZ3), CD63 
(human H5C6, mouse NVG-2), CD81 (human 5a6, mouse Eat-2), CCR5 
(mouse HM-CCRS) and/or ADAM10 surface antibodies from BioLegend 
for 60 min at 4 °C. Exosome fractions were then stained with PKH67 
(Sigma) as recommended by the manufacturer. Exosome fractions 
were washed in 40 ml PBS and ultracentrifuged again at 100,000g for 
60 min. Washed exosome pellets were resuspended PBS. For LysoSen- 
sor experiments, A549 cells were stimulated for 4h (11M CpG-A, 10 nM 
bafilomycin, or medium alone). Cells were loaded with 1 1M Lysosensor 
Green DND-189 (Molecular Probes, Invitrogen) diluted in prewarmed 
medium and incubated for 15 min at 37 °C. Cells were collected with 


1ml5mM PBS-EDTA and washed once with FACS buffer (5% FCS, 1x 
PBS, 2 mM EDTA). Cells and exosomes were analysed using Beckman 
Coulter Cytoflex Cytometer. For FACS-assisted purification, exosome 
identification and isolation were performed as described”, and stained 
with acombination of CD81, CD63 and PHK67. Exosomes were sorted 
using the FACS ARIA Iu SORP cell sorter. 


Western blotting 

We collected 1x 10° cells, washed them with PBS, and suspended them 
in RIPA buffer (Thermo-Scientific) containing 10x protease inhibitor 
(Santa Cruz Biotechnologies). Tissue homogenate was then pelleted 
twice at 10,000g for 10 min at 4 °C. Protein concentration in the super- 
natant was measured by Bradford assay and reduced using 4x Laemmli 
buffer containing B-mercaptoethanolat 95 °C for 5 min. For gel electro- 
phoresis, 10-30 pg of protein was run at 120 Vfor 1h using a4-12% gra- 
dient protein gel (Thermo Fisher). Proteins were then transferredtoan 
Immuno-Blot polyvinylidene fluoride (PVDF) membrane through Bio- 
Rad semi-dry transfer apparatus for 1h, at12 V constant. Membrane was 
incubated for 30 min with 5% non-fat dairy milk, and mouse anti-B-actin 
(Abcam) at 1/10,000, polyclonal rabbit anti-ADAM1O0 (Cell Signaling) 
at 1/2,500, mouse anti-ATG16L1 (MBL) at 1/1,000, monoclonal rabbit 
anti-CD9 (Cell Signaling) at 1/1,000, monoclonal mouse anti-CD81 
(Cell Signaling) at 1/1,000, polyclonal rabbit anti-ARF6 (Cell Signaling) 
at 1/2,000, polyclonal rabbit anti-ASS1 (Abcam) at 1/1,000, polyclonal 
rabbit anti- HLA (Sigma) at 1/5,000, polyclonal rabbit anti-SQSTM1 
(Cell Signaling) at 1/2,000, monoclonal rabbit anti-LC3 (Cell Signaling) 
at 1/2,000, or polyclonal rabbit anti-STX17 (Abcam) at 1/1,000 were 
probed overnight at 4 °C. Membranes were washed three times for 5 min 
and probed with secondary antibody rabbit-anti mouse LICOR IRDye 
800CW and goat anti-rabbit LICOR IRDye 800CW 680 antibodies for 
1hat room temperature. After additional washing, protein was then 
detected with a LICOR Odyssey CLX imaging system. 


Exosome isolation 

Forty-eight hours before isolation, 1 x 10’ A549 cells were plated in 
150-mm tissue-culture dishes. At 24 h before isolation, approximately 
35-40 ml of 10% Dulbecco’s modified Eagle medium (DMEM: 10% fetal 
bovine serum (FBS), 1% nonessential amino acids and 1% penicillin/ 
streptomycin) was removed, and fresh 10% DMEM without supplements 
was added to each dish. On the day of exosome isolation, medium from 
each plate was removed and centrifuged once at 500g for 10 min, then 
centrifuged once at 10,000g for 10 min. Supernatants were passed 
through a 0.22-um filter and finally ultracentrifuged at 100,000g for 
90 min. Following ultracentrifugation, supernatant was discarded. The 
pellet remaining after ultracentrifugation was collected and used for 
downstream analysis. When assessing by western blot, normalization 
was performed by controlling for the number of cells seeded onto the 
plate, and input loading was confirmed by probing CD9 inthe depleted 
cell fractions. For exosome isolation from broncholear lavage (BAL), 
mice were killed and the trachea exposed. Usinga 0.5-inch blunt-nose 
needle, 1 ml of PBS was flushed into the lungs and removed three times. 
The exosome-isolation protocol was then performed on the remaining 
BAL fraction as described above. 


a-Toxin and exosome treatment of cultured cells 

To determine cell sensitivity to a-toxin, we seeded 3 x 10* A549 cells 
in 96-well plates and allowed them to attach overnight. Various con- 
centrations of a-toxin were then added and incubated together for 
3 h at 37 °C. We collected 50 ml of supernatant and measured cell 
death, either by LDH release (which indicates pore formation; Pro- 
mega CytoTox-One Kit) or by metabolic activity via CellTiter (Promega 
catalogue number G3582). Total cytolysis was calculated according to 
the manufacturer's instructions. For experiments in which the protec- 
tive ability of exosomes was analysed, 3 x 10* A549 cells were seeded 
in 96-well plates and allowed to attach overnight. Exosome fractions 
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were isolated from 35 ml of A549 culture supernatant as described 
above. The exosome fraction was mixed with 1 pg mI a-toxin in PBS. 
The exosome/a-toxin mixture was incubated at 37 °C for 30 min and 
then added to plated A549 cells. 


Heat-killed bacteria, bacterial components and inhibitors 

We seeded 5 x 10° A549 cells and allowed them to attach overnight. The 
following day, cells were washed with PBS and new medium was added, 
including 5 x 10° CFU of heat-killed S. aureus, 2 x 10° CFU of heat-killed 
S. pneumoniae, 5 x 10° CFU of heat-killed C. rodentium, 5 x 10° CFU of 
heat-killed S. Typhimurium, 250 pg mI“ lipoteichoic acid (LTA, Sigma, 
cataloguenumber L2515), 11g ml“ lipopolysaccharide (LPS, InvivoGen, 
catalogue number tlrl-3pelps), 10 pg mI“ peptidoglycan (PDG, Sigma, 
catalogue number 77140), 2 uM CpG DNA, 0.5 pg ml Pam2CSK (Invivo- 
Gen, catalogue number tlrl-pm2s-1), 0.5 pg ml Pam3CSK (InvivoGen, 
catalogue number tIrl-pms), 0.5 pg mI S. aureus (SA) genomic DNA 
(gDNA, 0.5 pg mI), 0.5 pg mI“ S. aureus RNA, 2 1M GW4869 (Sigma, 
catalogue number D1692) or 200 nM Torin-1 (Tocris, catalogue number 
4247). After 4h or 18 h, supernatants were removed from cultures and 
exosomes were collected as described above. Exosomes were identified 
and quantified using the aforementioned flow-cytometry protocol. 


Infection and exosome treatment of mice 

Donor mice received an intranasal treatment of heat-killed S. aureus to 
induce exosome production. After 4-6 h, mice were bled submandibu- 
larly and plasma was collected. The exosome fraction was collected as 
described above for A549 cells. Recipient wild-type or Atg16L1™ mice 
each received exosomes intraperitonially isolated from 1 ml of plasma 
on day 1, day O, and day +1 of infection ina final volume of 1 ml of PBS. 
Mice were intravenously infected with USA300 S. aureus on day O, and 
were monitored daily for signs of morbidity. 


a-Toxin purification from S. aureus 

Primers VJT1391 (S’-GGGGG-AAGCTT-gtttgatatggaactcctgaatttttcg-3’; 
the underlined sequence is the HindIII site) and VJT1395 (5’ GATAA-GC 
TAGC-tta-GTGGTGGTGGTGGTGGTG-atttgtcatttcttc-3’; the underlined 
sequence is the Nhel site) were used to amplify the promoter region of 
hla followed by the hla gene and polyhistidine tag (6 x Histag) from the 
genomic DNA of S. aureus strain Newman by PCR. The PCR product was 
then cloned into the pOS1 plasmid using the HindIII and Nhel restric- 
tion sites to generate the pOS1-phla-hla-6his plasmid. The purified 
plasmid was transformed into Escherichia coliDH5a competent cells, 
selected by ampicillin resistance (100 pg mI) and confirmed by colony 
PCR and Sanger sequencing (Genewiz). The plasmid from a positive 
clone was purified and electroporated into S. aureus RN4220, selected 
by resistance to chloramphenicol (10 pg mI”); the plasmid purified 
from RN4220 was then electroporated into S. aureus Newman AlukED 
AhIgACB::tet AlukAB.:spec Ahla:sermC (AMAA) and selected for by resist- 
ance to chloramphenicol (10 pg ml) resistance. For purification of 
His-tagged a-toxin, the S. aureus Newman AAAA strain harbouring the 
pOS1-phla-hla-6his plasmid (strain VJT 45.56) were grown overnight 
in5 ml TSB (Fisher) supplemented with chloramphenicol (10 pg ml*) 
at 37 °C, shaking at 180 rpm, then subcultured the following day at a 
1/100 dilutionin TSB supplemented with chloramphenicol (10 pg mI”) 
and incubated for 5h at 37 °C, shaking at 180 rpm. The cultures were 
centrifuged for 15 min at 6,000 rpm and 4 °C, and thesupernatants were 
filter-sterilized through a 0.22-um filter (Corning). The filtrates were 
incubated inthe presence ofa final concentration of 10 mMimidazole 
and nickel-nitrilotriacetic acid (Ni-NTA) agarose resin (Qiagen) equili- 
brated with 10 mM imidazole (Fisher) in 1x Tris-buffered saline (TBS; 
Cellgro) for 30 min at 4 °C while nutating. The filtrates were passed 
throughaglass column by gravity filtration, then Ni-NTA-bound toxins 
were washed with 25 mM imidazole, followed by a secondary wash 
with 1x TBS. The Ni-NTA-bound toxins were eluted using 500 mM imi- 
dazole. The eluted toxins were dialysed into 10% glycerol in 1x TBS 


and filtered through a 0.22-um filter before storage at -80 °C. When 
required, the toxins were concentrated using concentrator columns 
(Ultra-15 Centrifugal Filter Units 10,000 NMWL, 15-ml volume capac- 
ity; EMD Millipore Amicon) before measuring protein concentration 
using absorbance at 280 nm with a Nanodrop (Thermo Scientific) and 
Beer-Lambert’s equation. We separated 2 pg of the purified proteins 
by SDS-PAGE at 90 V for 120 min, followed by Coomassie blue staining 
to confirm protein purity by visualization. 


Sample preparation for mass spectrometry 

Exosomes were lysedin 8 M urea containing 10% SDS. Lysed exosomes 
were reduced using dithiothreitol (5 pl of 0.2 M concentration) for 
That 55°C. The reduced cysteines were subsequently alkylated with 
iodoacetamide (5 pl of 0.5 M) for 45 min in the dark at room temper- 
ature. Each sample was loaded onto S-trap microcolumns (Protifi) 
according tothe manufacturer’s instructions. In brief, 3 pl of 12% phos- 
phoric acid and 165 ul of binding buffer (90% methanol, 100 mM trieth- 
ylammonium bicarbonate (TEAB)) were added to each sample. Samples 
were loaded onto the S-trap columns and centrifuged at 4,000g for 
30 s. After three washes, 20 pl of 50 mM TEAB and 1 pg of trypsin (1/50 
ratio) were added to the trap and incubated at 47 °C for 1h. Peptides 
were then eluted using 40% acetonitrile (ACN) in 0.5% acetic acid 
followed by 80% ACN in 0.5% acetic acid. Eluted peptides were dried 
and concentrated in aSpeedVac. 


Liquid chromatography-tandem mass spectrometry analysis 

We loaded 1 pg of each sample onto a trap column (Acclaim PepMap 
100 pre-column, 75 um x 2.cm, C18, 3 pm, 100 A, Thermo Scientific) 
connected to an analytical column (EASY-Spray column, 50 pm x 75 ym 
ID, PepMap RSLC C18, 21m, 100A, Thermo Scientific) using the autosa- 
mpler of an Easy nLC1000 (Thermo Scientific) with solvent A consist- 
ing of 2% acetonitrile in 0.5% acetic acid and solvent B consisting of 
80% acetonitrile in 0.5% acetic acid. The peptide mixture was gradient 
eluted into the Orbitrap QExactive HF-X Mass Spectrometer (Thermo 
Scientific) using the following gradient: 5-35% solvent B for 120 min, 
35-45% solvent B for 10 min, and 45-100% solvent B for 20 min. The full 
scan was acquired witha resolution of 60,000 (at an m/zratio of 200), a 
target value of 3 x 10° anda maximum ion time of 45 ms. Following each 
full mass-spectrometry (MS) scan, 20 data-dependent MS/MS spectra 
were acquired. These MS/MS spectra were collected witha resolution 
of 15,000, an AGC target of 1 x 10°, amaximum ion time of 120 ms, one 
microscan, a2 m/zisolation window, a fixed first mass of 150 m/z, a 
dynamic exclusion of 30s, and a normalized collision energy of 27. 


Analysis of mass-spectrometry data 

All acquired MS/MS spectra were searched against the UniProt mouse 
reference database using Sequest HT within Proteome Discoverer 1.4 
(Thermo Fisher Scientific). The parameters for searching MS/MS data 
were set as follows: precursor mass tolerance +10 ppm, fragment mass 
tolerance + 0.02 Da, digestion enzyme trypsin allowing two missed 
cleavages, fixed modification of carbamidomethyl on cysteine, variable 
modification of oxidation on methionine, and variable modification of 
deamidation on glutamine and asparagine. The results were filtered using 
a1% peptide and protein false discovery rate searched against a decoy 
database and requiring proteins to have at least two unique peptides. 


a-Toxin oligomerization assay 

Exosomes were collected from A549 culture supernatants as described 
above. Exosome fractions were resuspended in 30 pl PBS. a-Toxin was 
added to exosome suspension at a concentration of 1 pg mI. The 
exosome/a-toxin combination was then shaken at room tempera- 
ture for 1h. Following incubation, the exosome/a-toxin mixture was 
resuspended in 40 ml PBS and spun at 100,000g for 90 min to pellet 
exosomes with bound a-toxin and remove excess a-toxin. The exosome 
fraction was resuspended in RIPA buffer containing 4x Laemmli buffer 


without B-mercaptoethanol. For gel electrophoresis, each sample 
(including an a-toxin-only lane) was run at 120 V for 1h using a4-20% 
tris-glycine gradient protein gel (Thermo Fisher). Proteins were then 
transferred to anImmuno-Blot PVDF membrane through Bio-Rad semi- 
dry transfer apparatus for 1h at aconstant voltage of 12 V. Membrane 
was incubated for 30 min with 5% non-fat dairy milk, and mouse anti- 
a-toxin (Sigma) at 1/5,000 was probed overnight at 4 °C. Membranes 
were washed three times for 5 min and probed with secondary antibody 
goat anti-rabbit LICOR IRDye 800CW 680 antibodies for 1h at room 
temperature. After additional washing, protein was then detected with 
a LICOR Odyssey CLX imaging system. 


Transmission electron microscopy 

For analysis ofexosome morphology, we placed 5 pl of isolated exosomes 
on glow-discharged carbon-coated 400-mesh copper/rhodium grids 
and stained the samples with 1% uranyl acetate aquous solution. For 
whole-mountimmune-electron microscopy, we deposited 5 pl of 2%-par- 
aformaldehyde-fixed exosomes on glow-discharged formvar-carbon- 
coated copper grids, and allowed the samples to adsorb for 20 min. 
After washing with PBS, the grids were incubated with 50 mM glycine/ 
PBS for 5 min, blocked with 1% coldwater fish skin gelatin (Sigma) for 
10 min, and incubated with primary antibodies (anti-TSG101, Abcam) in 
blocking solution for 2 hat room temperature. Following washing with 
PBS, gold-conjugated secondary antibodies (15 nm protein-A-gold, Cell 
Microscopy Center, University Medical Center Utrecht; 12-nm colloidal 
gold AffiniPure goat anti-rabbit IgG (H+L), Jackson ImmunoReasearch 
Laboratories) were applied in the blocking buffer for 1h. After washing 
with PBS, the grids were fixed in 1% glutaraldehyde in PBS for 5 min, 
washed with water, contrasted and embedded ina mixture of 3% uranyl 
acetate and 2% methylcellulose at a ratio of 1/9. All stained grids were 
examined under a Philips CM-12 electron microscope and photographed 
with a Gatan (4kx2.7k) digital camera (Gatan, Pleasanton, CA)”. 


Statistical analysis 

All analyses were performed with Graphpad Prism v.7. The numbers of 
animals or biological replicates used herein were estimated onthe basis 
of a power analysis with the following assumptions: the standard devia- 
tion will be roughly 20% of the mean; Pvalues willbe less than 0.05 when 
the null hypothesis is false; and the effect size (Cohen’s d) is between1.0 
and 2.0. The minimal number of mice required under these conditions 
ranges from 6 to 28 for in vivo experiments. We have also carefully chosen 
the indicated sample size on the basis of empirical evidence of what is 
necessary to interpret the data and statistical significance. A unpaired 
two-tailed t-test with Welch’s correction was used to evaluate differences 
between two groups. One-way ANOVA with Dunnet’s post-test analysis 
was performed to evaluate differences between groups of three or more. 
The log-rank Mantel-—Cox test was used for comparison of mortality 
curves. No randomization or blinding was used inthis study. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The datasets generated herein are available from the corresponding 
authors upon request. Western blot gel source data can be found in 
Supplementary Fig. 1. All identified proteins from mass-spectrometry 
experiments and thier accession identification codes are listed in Sup- 
plementary Tables 1-3. Source Data for Fig. 4 and Extended Data Fig. 6 
are available with the paper. 
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Extended Data Fig. 1| ADAM10 and EpCAM levels following lysosomal 
inhibition with ammonium chloride, chloroquine or bafilomycin or 
proteasomal inhibition with MG132.a, Time course of flow-cytometry 
analysis of ADAM10 following lysosomal inhibition with ammonium chloride 
(NH,Cl, 20 mM), chloroquine (CQ,50 uM) or PBS as acontrol; n=3. 

b-d, Western blot analysis of ADAM10 and SQSTM1 following lysosomal 
inhibition with NH,Cl, CQ or bafilomycin (BAF, 10nM).Shown area 
representative western blot from four independent experiments (b), 
quantification of ADAM10 levels (n=5) (c) and quantification of SQSTM1 levels 
(n=3) (d) at 24 hafter inhibition. e, f, Representative histogram (e) and 


quantification (f) of cell-surface E>DCAMin BAF-treated A549 cells; n=3.g, Time 
course of flow-cytometry analysis of EpCAM following treatment with NH,Cl or 
CQ;n=4.h,i, ADAM1O, P4D1 and actin levels following proteasomal inhibition 
with the chemical compound MG132. Shown area flow-cytometry time course 
of cell-surface ADAM10 levels following MG132 treatment (h) anda 
representative western blot from three independent experiments (i);n=3. 
Measurements were taken from distinct samples and graphs show 
meant+s.e.m.a,c,d, f-h, One-way ANOVA with Dunnet’s post-test compared 
withPBS treatment or time 0. 
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Extended Data Fig. 2 |Exosome-isolation and quantification strategies. 

a, Exosome-isolation protocol from in vitro or in vivo sources. Exosomes are 
isolated using a multistep centrifugation procedure including a0.22-um 
filtration step.b, Western blot of actin, ARF6 and CD9 following each 
sequential centrifugation step during exosome isolation. c, Electron- 
microscopy (EM) quantification of vesicles 80-150 nm and greater than150 nm 
insize;n=80images. d, EMnegative staining of exosome fractions. Arrows 
indicate exosomes and protein aggregates. e, Representative EM images of 
the exosome fraction, and zoomed insets with arrows indicating the single 


CD63-PacBlue 


w ‘of 10° wo? we 10° 

CDB1-PE-Cy7 ADAM10-APC 
membranes of exosomes. f, Gating strategy and representative flow- 
cytometry plots from nt shRNA and ATGI6L1KD samples of sixindependently 
repeated experiments. Exosomes were stained with antibodies against CD9, 
CD63, CD81and ADAM10. Exosomes were concurrently labelled with PKH67, a 
lipid-membrane-incorporating dye. FSC, forward scatter; SSC, sidescatter. 
Measurements were taken from distinct samples and graphs show 
means +s.e.m.c, Two-tailed, unpaired t-test with Welch’s correction compared 
withPBS controls. 
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CD9 (exo CD9) and exosome ADAMI1O0 (exo ADAM10) following addition of CQ taken from distinct samples, and graphs show means ¢+s.e.m. b-d, One-way 
or BAF. Shown are a representative western blot from six independent ANOVA with Dunnet’s post-test compared with PBS controls. 
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Extended Data Fig. 4 | Exosomes are produced in response to bacterial 
exposure.a, Flow-cytometry quantification of exosomes per 100,000 events 
inmouse BMDCsand BMDMs with or without exposure to HKSA (BMDCs plus 
PBS, n= 6; BMDCs plus HKSA, n= 6; BMDMs plus PBS, n=3; BMDMs plus HKSA, 
n=4).b-h, Quantification of total exosomes in A549 cell-culture supernatant 
by flowcytometry 18 hafter treatment with peptidoglycan (PDG;b), 
lipopolysaccharide (LPS; c), lipoteichoic acid (LTA; d), Pam3CSK (e), Pam2CSK 
(f) or S. aureus RNA (SA RNA; g);n=3.h, Quantification of totalexosomes in 
TLR9 KD A549 cell-culture supernatants following treatment with CpG DNA; 
n=3.i, Flow-cytometry quantification of A549-produced exosomes following 
exposure to HKSA or to astrain of S. aureus deficient in the production of 
a-toxin (HK dHLA).j, Flow-cytometry quantification of exosomes isolated from 
A549 cells treated with PBS (n =3), S. aureus genomic DNA (SAgDNA; 

0.5 pg ml7; n=5), and/or DNasel (n=2).k, Flow-cytometry quantification of 


v 


exosomes isolated from cells treated with BAF (n=5), Torin-1(n=6), or both 
(n=3).1, Representative western blot of SQSMT1, LC31/II and actin in cells 
treated with BAF, Torin-1or both 4 hafter treatment, from twoindependent 
experiments. m, Flow-cytometry quantification of exosomes from A549 cells 
treated with PBS (mock treatment; n=8), CpG DNA (n=8), or CpG DNAand 
GW4869 (n=7).n, 0, Plasma exosome quantification of ATG16L1 flow/flox; 
CD11c-Cre(n), and ATG16L1 flow/flox; LysM-Cre (0) following exposure to 
either CpG DNA or HKSA, respectively. p, Representative western blot of 
ADAMI1O, ASS1, CD9 and CD81 in exosome fractions submitted to mass 
spectrometry, from three independent experiments. Measurements were 
taken from distinct samples and graphs show means +s.e.m.a, b-h, Two-tailed, 
unpaired t-test with Welch’s correction compared with PBS controls. 

j,k, m-o, One-way ANOVA with Dunnet’s post-test compared with PBS, mock, 
CpG DNAorCre-/+ controls. 
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Extended Data Fig. 5 | BAF and CpG DNA decrease acidic organelles. 

a-c, Representative flow-cytometry histograms from three independent 
experiments of Lysosensor signal following treatment with BAF or CpG 

DNA (a), or BAF with or without CpG DNA (b).c, Quantification of Lysosensor 


MFI following treatment with PBS alone (no BAF or CpG DNA; n=8), BAF (n=6), 
CpG DNA (n= 9), or BAF plus CpG DNA (n=6) (c). Measurements were taken 
from distinct samples and graphs show means +s.e.m.c, One-way ANOVA with 
Dunnet’s post-test compared with PBS controls. 
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Extended Data Fig. 6| Exosomes protect from S. aureus toxicity in vitro and 
in vivo. a, Flow-cytometry exosome quantification from nt shRNA control and 
ADAMI10 KDAS549 cells; n=3. b, Cell death, measured by LDH release, of A549 
cells treated with a-toxin only, pretreated with HKSA or CpG DNA and o-toxin 
(‘induced’), or pre-exposed to HKSA or CpG DNA followed by PBS wash and then 
q-toxin treatment (‘induced; washed’);n=5. c, Representative flow-cytometry 
histograms of CCR5 on CD81-positive, CD63-positive and CD9-positive 
exosomes isolated from mouse BMDMs. d, Exogenous exosome-transfer 
protocol. In step 1, donor miceare pre-exposed to HKSAi.v. to induce exosome 
production. Instep 2, exosomes from donor mice are injected intraperitoneally 
(i.p.) on day -1, day O and day +1 following lethali.v. injection of S. aureus. 

e, Survival of wild-type mice infected i.v. with a lethal dose of 5 x 10’ CFU of 

S. aureus (USA300) that were mock-treated (n=10) or injected i.p. with 
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exosomes from Atg16l1™ mice (n=8). f, Endogenous exosome-protection 
protocol. Mice arei.v. injected with HKSA to induce exosome production. Four 
hours later, mice are infected witha lethal dose of S. aureus (2.5-5 x10”). 

g,h, Western blot analysis of a-toxin oligomerization in total BAL or in exosome 
fractionin BAL of mice pre-exposed to HKSA or PBS intranasally (i.n.), 
representative of four independent experiments. i,j, Quantification of a-toxin 
monomer (i) and heptamer (j) in BAL and exosome fraction following 
pre-exposure;n=4.k, Ratio of a-toxin heptamer in exosome fraction to total 
a-toxin signal in BAL; n=4. Measurements were taken from distinct samples 
and graphs show means +s.e.m.a,i-k, Two-tailed, unpaired t-test with Welch’s 
correction.b, One-way ANOVA with Dunnet’s post-test compared with a-toxin 
only or ‘induced’ controls. e, log-rank Mantel-Cox test. 
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Since the outbreak of severe acute respiratory syndrome (SARS) 18 years ago, alarge 
number of SARS-related coronaviruses (SARSr-CoVs) have been discovered in their 
natural reservoir host, bats’ *. Previous studies have shown that some bat SARSr-CoVs 
have the potential to infect humans*”. Here we report the identification and 
characterization of anew coronavirus (2019-nCoV), which caused an epidemic of 
acute respiratory syndrome in humans in Wuhan, China. The epidemic, which started 
on12 December 2019, had caused 2,794 laboratory-confirmed infections including 80 
deaths by 26 January 2020. Full-length genome sequences were obtained from five 
patients at an early stage of the outbreak. The sequences are almost identical and 
share 79.6% sequence identity to SARS-CoV. Furthermore, we show that 2019-nCoVis 
96% identical at the whole-genome level to a bat coronavirus. Pairwise protein 
sequence analysis of seven conserved non-structural proteins domains show that this 
virus belongs to the species of SARSr-CoV. In addition, 2019-nCoV virus isolated from 
the bronchoalveolar lavage fluid of a critically ill patient could be neutralized by sera 
from several patients. Notably, we confirmed that 2019-nCoV uses the same cell entry 
receptor—angiotensin converting enzyme II (ACE2)—as SARS-CoV. 


Coronaviruses have caused two large-scale pandemics in the past 
two decades, SARS and Middle East respiratory syndrome (MERS)?®”. 
Ithas generally been thought that SARSr-CoV—which is mainly found 
in bats—could cause a future disease outbreak’. Here we report 
on a series of cases caused by an unidentified pneumonia disease 
outbreak in Wuhan, Hubei province, central China. This disease out- 
break—which started from a local seafood market—has grown sub- 
stantially to infect 2,761 people in China, is associated with 80 deaths 
and has led to the infection of 33 people in 10 additional countries 
as of 26 January 2020”. Typical clinical symptoms of these patients 
are fever, dry cough, breathing difficulties (dyspnoea), headache 
and pneumonia. Disease onset may result in progressive respiratory 
failure owing to alveolar damage (as observed by transverse chest 
computerized-tomography images) and even death. The disease was 
determined to be caused by virus-induced pneumonia by clinicians 
according toclinical symptomsand other criteria, including arise in 
body temperature, decreases in the number of lymphocytes and white 
blood cells (although levels of the latter were sometimes normal), new 
pulmonary infiltrates on chest radiography and no obvious improve- 
ment after treatment with antibiotics for three days. It appears that 
most of the early cases had contact history with the original seafood 
market; however, the disease has now progressed to be transmitted 
by human-to-human contact. 


Samples from seven patients with severe pneumonia (six of whomare 
sellers or deliverymen from the seafood market), who were admitted to 
the intensive care unit of Wuhan Jin Yin-Tan Hospital at the beginning 
of the outbreak, were sent to the laboratory at the Wuhan Institute of 
Virology (WIV) forthe diagnosis of the causative pathogen (Extended 
Data Table1). Asa laboratory investigating CoV, we first used pan-CoV 
PCR primers to test these samples”, given that the outbreak occurred in 
winter andinamarket—the same environment as SARS infections. We 
found five samples to be PCR- positive for CoVs. One sample (WIVO4), 
collected from the bronchoalveolar lavage fluid (BALF), was analysed by 
metagenomics analysis using next-generation sequencing to identify 
potential aetiological agents. Of the 10,038,758 total reads—of which 
1,582 total reads were retained after filtering of reads fromthe human 
genome-—1,378 (87.1%) sequences matched the sequence of SARSr- 
CoV (Fig. 1a). By de novo assembly and targeted PCR, we obtained a 
29,891-base-pair CoV genome that shared 79.6% sequence identity 
to SARS-CoV BJO1 (GenBank accession number AY278488.2). High 
genome coverage was obtained by remapping the total reads to this 
genome (Extended Data Fig. 1). This sequence has been submitted to 
GISAID (https://www.gisaid.org/) (accession number EPLISL_402124). 
Following the name given by the World Health Organization (WHO), 
we tentatively call it novel coronavirus 2019 (2019-nCoV). Four more 
full-length genome sequences of 2019-nCoV (WIVO2, WIVOS, WIVO6 and 
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Fig. 1| Genome characterization of 2019-nCoV. a, Metagenomics analysis of 
next-generation sequencing of BALF from patient ICU06. b, Genomic 
organization of 2019-nCoV WIVO04.M, membrane.c, Similarity plot basedon 
the full-length genome sequence of 2019-nCoV WIVO04. Full-length genome 
sequences of SARS-CoV BJ01, bat SARSr-CoV WIVI1, bat coronavirus RaTG13 and 
ZC45 were used as reference sequences. d, Phylogenetic tree based on 


WIV07) (GISAID accession numbers EPI ISL_402127-402130) that were 
more than 99.9% identical to each other were subsequently obtained 
from four additional patients using next-generation sequencing and 
PCR (Extended Data Table 2). 

The virus genome consists of six major open-reading frames (ORFs) 
that are common to coronaviruses and a number of other accessory 
genes (Fig. 1b). Further analysis indicates that some of the 2019-nCoV 
genes shared less than 80% nucleotide sequence identity to SARS-CoV. 
However, the amino acid sequences of the seven conserved replicase 
domains in ORF lab that were used for CoV species classification were 
94.4% identical between 2019-nCoV and SARS-CoV, suggesting that 
the two viruses belong to the same species, SARSr-CoV. 

Wethen found that ashort region of RNA-dependent RNA polymerase 
(RdRp) from a bat coronavirus (BatCoV RaTG13)—which was previously 
detected in Rhinolophus affinis from Yunnan province—showed high 
sequence identity to 2019-nCovV. We carried out full-length sequencing 
onthis RNA sample (GISAID accession number EPI_ISL_402131). Simplot 
analysis showed that 2019-nCoV was highly similar throughout the 
genome to RaTGI13 (Fig. 1c), with an overall genome sequence identity 
of 96.2%. Using the aligned genome sequences of 2019-nCoV, RaTG13, 
SARS-CoV and previously reported bat SARSr-CoVs, no evidence for 
recombination events was detected in the genome of 2019-nCoV. Phy- 
logenetic analysis of the full-length genome and the gene sequences of 
RdRp and spike (S) showed that—for all sequences—RaTGI13 is the clos- 
est relative of 2019-nCoV and they forma distinct lineage from other 
SARSr-CoVs (Fig. 1d and Extended Data Fig. 2). The receptor-binding 
spike protein encoded by the S gene was highly divergent from other 
CoVs (Extended Data Fig. 2), with less than 75% nucleotide sequence 


0.4 


nucleotide sequences of complete genomes of coronaviruses. MHV, murine 
hepatitis virus; PEDV, porcine epidemic diarrhoea virus; TGEV, porcine 
transmissible gastroenteritis virus.The scale bars represent 0.1 substitutions 
per nucleotide position. Descriptions of the settings and software that was 
used are included inthe Methods. 


identity to all previously described SARSr-CoVs, except for a 93.1% 
nucleotide identity to RaTG13 (Extended Data Table 3). The S genes of 
2019-nCoV and RaTG13 are longer than other SARSr-CoVs. The major 
differences in the sequence of the S gene of 2019-nCoV are the three 
short insertions inthe N-terminal domain as well as changesin four out 
of five of the key residues in the receptor-binding motif compared with 
the sequence of SARS-CoV (Extended Data Fig. 3). Whether the inser- 
tions in the N-terminal domain of the S protein of 2019-nCoV confer 
sialic-acid-binding activity as it does in MERS-CoV needs to be further 
studied. The close phylogenetic relationship to RaTG13 provides evi- 
dence that 2019-nCoV may have originated in bats. 

We rapidly developed a qPCR-based detection method on the basis 
of the sequence of the receptor-binding domain of the S gene, which 
was the most variable region of the genome (Fig. 1c). Our data show 
that the primers could differentiate 2019-nCoV from all other human 
coronaviruses including bat SARSr-CoV WIV1, which shares 95% identity 
with SARS-CoV (Extended Data Fig. 4a, b). Ofthe samples obtained from 
the seven patients, we found that six BALF and five oral swab samples 
were positive for 2019-nCoV during the first sampling, as assessed 
by qPCR and conventional PCR. However, we could no longer detect 
virus-positive samples in oral swabs, anal swabs and blood samples 
taken from these patients during the second sampling (Fig. 2a). How- 
ever, we recommend that other qPCR targets, including the RdRp or 
envelope (E) genes are used for the routine detection of 2019-nCovV. 
On the basis of these findings, we propose that the disease could be 
transmitted by airborne transmission, although we cannot rule out 
other possible routes of transmission, as further investigation, includ- 
ing more patients, is required. 
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Fig. 2 |Molecular and serological investigation of patient samples. 

a, Molecular detection of 2019-nCoV in seven patients. Patient information can 
be found in Extended Data Tables 1, 2. Detection methods are described in 

the Methods. AS, anal swab; OS, oral swab. b, Dynamics of 2019-nCoV antibody 
levels in one patient who showed signs of disease on 23 December 2019 (ICU- 
06). OD ratio, optical density at 450-630 nm. The right and left yaxes indicate 


For serological detection of 2019-nCoV, we used a previously devel- 
oped nucleocapsid (N) protein from bat SARSr-CoV Rp3 as antigen 
for IgG and IgM enzyme-linked immunosorbent assays (ELISAs), as 
this protein shared 92% amino acididentity to N protein of 2019-nCoV 
(Extended Data Fig. 5) and showed no cross-reactivity against other 
human coronaviruses except SARSr-CoV’. We were only able to obtain 
five serum samples from the seven patients with viral infections. We 
monitored viral antibody levels in one patient (ICU-06) 7, 8, 9 and 
18 days after the onset of disease (Extended Data Table 2). Acleartrend 
was observed in the IgG and IgM titres, which increased over time, 
except that the IgM titre was decreased in the last sample (Fig. 2b). 
As asecond analysis, we tested samples from 5 of the 7 virus-positive 
patients around 20 days after disease onset for the presence of viral 
antibodies (Extended Data Tables 1, 2). All patient samples—but not 
samples from healthy individuals—were strongly positive for viral lgG 
(Fig. 2b). There were also three IgM-positive samples, indicating an 
acute infection. 

We next successfully isolated the virus (called 2019-nCoV BetaCoV/ 
Wuhan/WIV04/2019) from both Vero E6 and Huh7 cells using the BALF 
sample of patient ICU-06. Clear cytopathogenic effects were observed 
incells after incubation for three days (Extended Data Fig. 6a, b). The 
identity of the strain WIV04 was verified in Vero E6 cells by immuno- 
fluorescence microscopy using the cross-reactive viral N antibody 
(Extended Data Fig. 6c, d) and by metagenomics sequencing, most of 
the reads of which mapped to 2019-nCoV, and qPCR analysis showed 
that the viral load increased from day 1to day 3 (Extended Data Fig. 6e, f). 
Viral particles in ultrathin sections of infected cells displayed a typi- 
cal coronavirus morphology, as visualized by electron microscopy 
(Extended Data Fig. 6g). To further confirm the neutralization activity 
of the viral lgG-positive samples, we conducted serum-neutralization 
assays in Vero E6 cells using the five patient sera that were IgG-positive. 
We demonstrate that all samples were able to neutralize 100 TCID,, 
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Patient Healthy Patient Healthy 


ELISA OD ratios for IgM and IgG, respectively. c, Serological test of 2019-nCoV 
antibodies in five patients (Extended Data Table 2). The asterisk indicates data 
collected from patient ICU-06 on 10 January 2020. b,c, The cut-off was to 0.2 
for the IgM analysis and to 0.3 for the IgG analysis, according to the levels of 
healthy controls. 


(50% tissue-culture-infective dose) of 2019-nCoV at a dilution of 
1:40-1:80. We also show that this virus could be cross-neutralized by 
horse anti-SARS-CoV serum (gift from L.-F. Wang) at dilutions of 1:40; 
however, the potential for cross-reactivity with SARS-CoV antibod- 
ies needs to be confirmed with anti-SARS-CoV serum from humans 
(Extended Data Table 4). 

ACE2 is known to be a cell receptor for SARS-CoV". To determine 
whether 2019-nCoV also uses ACE2 as acellular entry receptor, we 
conducted virus infectivity studies using HeLa cells that expressed or 
did not express ACE2 proteins from humans, Chinese horseshoe bats, 
civets, pigs and mice. We show that 2019-nCoV is able to use all ACE2 
proteins, except for mouse ACE2, as an entry receptor to enter ACE2- 
expressing cells, but not cells that did not express ACE2, indicating that 
ACE2is probably the cell receptor through which 2019-nCoVenters cells 
(Fig. 3). We also show that 2019-nCoV does not use other coronavirus 
receptors, such as aminopeptidase N (APN) and dipeptidyl peptidase 
4 (DPP4) (Extended Data Fig. 7). 

Thestudy provides a detailed report on 2019-nCoV, the likely aetio- 
logical agent responsible for the ongoing epidemic of acute respiratory 
syndrome in China and other countries. Virus-specific nucleotide- 
positive and viral-protein seroconversion was observed in all patients 
tested and provides evidence of an association between the disease and 
the presence of this virus. However, there are still many urgent ques- 
tions that remain to be answered. The association between 2019-nCoV 
and the disease has not been verified by animal experiments to fulfil 
the Koch’s postulates to establish a causative relationship between a 
microorganism and a disease. We do not yet know the transmission 
routine of this virus among hosts. It appears that the virus is becom- 
ing more transmissible between humans. We should closely monitor 
whether the virus continues to evolve to become more virulent. Owing 
toa shortage of specific treatments and considering the relatedness of 
2019-nCoV to SARS-CoV, some drugs and pre-clinical vaccines against 
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Fig. 3 | Analysis of the receptor use of 2019-nCoV. Determination of virus 
infectivity in HeLa cells that expressed or did not express (untransfected) 
ACE2. The expression of ACE2 plasmid with S tag was detected using mouse 
anti-S tag monoclonal antibody. hACE2, human ACE2; bACE2, ACE2 of 
Rhinolophus sinicus (bat); cACE2, civet ACE2; sSACE2, swine ACE2 (pig); mACE2, 
mouse ACE2. Green, ACE2; red, viral protein (N); blue, DAPI (nuclei). Scale bars, 
10 um. 


SARS-CoV could probably be used to treat this virus. Finally, consider- 
ing the wide spread of SARSr-CoV in their natural reservoirs, future 
research should be focused on active surveillance of these viruses 
for broader geographical regions. In the long term, broad-spectrum 
antiviral drugs and vaccines should be prepared for emerging infec- 
tious diseases that are caused by this cluster of viruses in the future. 
Most importantly, strict regulations against the domestication and 
consumption of wildlife should be implemented. 


Note added in proof: Since this paper was accepted, the ICTV has desig- 
nated the virus as SARS-CoV-2"; in addition, the WHO has released the 
official name of the disease caused by this virus, which is COVID-19". 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Sample collection 

Human samples, including oral swabs, anal swabs, blood and BALF 
samples were collected by Jinyintan hospital (Wuhan, China) with 
the consent of all patients and approved by the ethics committee of 
the designated hospital for emerging infectious diseases. Patients 
were sampled without gender or age preference unless indicated. 
For swabs, 1.5 ml DMEM containing 2% FBS was added to each tube. 
The supernatant was collected after centrifugation at 2,500 rpm, 
vortexing for 60 s anda standing period of 15-30 min. The superna- 
tant from swabs or BALF (no pre-treatment) was added to either lysis 
buffer for RNA extraction or to viral transport medium for isolation 
of the virus. The viral transport medium was composed of Hank’s 
balanced salt solution (pH 7.4) containing BSA (1%), amphotericin 
(15 pg mI”), penicillin G (100 units mI”) and streptomycin (50 pg mI’). 
Serum was separated by centrifugation at 3,000g for 15 min within 
24h of collection, followed by inactivation at 56 °C for 1h, and was 
then stored at 4 °C until use. 


Virus isolation, cell infection, electron microscopy and 
neutralization assay 

The following cell lines were used for virus isolation in this study: Vero 
E6 and Huh7 cells, which were cultured in DMEM containing 10% FBS. All 
celllines were tested and free of mycoplasma contamination, submitted 
for species identification and authenticated by morphological evalua- 
tion by microscopy. None of the cell lines was on the list of commonly 
misidentified cell lines (by ICLAC). 

Cultured cell monolayers were maintained in their respective medium. 
The PCR-positive BALF sample from ICU-06 patient was spun at 8,000g 
for 15 min, filtered and diluted 1:2 with DMEM supplemented with 
16 pg mI trypsin before it was added to the cells. After incubation at 
37 °C for 1h, the inoculum was removed and replaced with fresh culture 
medium containing antibiotics (see below) and 16 pg mI trypsin. The 
cells were incubated at 37 °C and observed daily for cytopathogenic 
effects. The culture supernatant was examined for the presence of virus 
by qRT-PCR methods developed in this study, and cells were examined 
by immunofluorescence microscopy using the anti-SARSr-CoV Rp3 N 
antibody that was generated in-house (1:1,000). Penicillin (100 units mI“) 
and streptomycin (15 jg mI’) were included inall tissue culture media. 

Vero E6 cells were infected with the new virus at a multiplicity of 
infection (MOI) of 0.5 and collected 48 h after infection. Cells were fixed 
with 2.5% (w/v) glutaraldehyde and 1% osmium tetroxide, dehydrated 
through a graded series of ethanol concentrations (from 30 to 100%) 
and embedded with epoxy resin. Ultrathin sections (80 nm) of embed- 
ded cells were prepared, deposited onto Formvar-coated copper grids 
(200 mesh), stained with uranyl acetate and lead citrate, and analysed 
using a200-kV Tecnai G2 electron microscope. 

The virus neutralization test was carried out in a 96-well plate. 
The patient serum samples were heat-inactivated by incubation at 
56 °C for 1h before use. The serum samples were diluted to 1:10, 1:20, 
1:40 or 1:80, and then an equal volume of virus stock was added and 
incubated at 37 °C for 60 min in a 5% CO, incubator. Diluted horse 
anti-SARS-CoV serum or serum samples from healthy individuals 
were used as control. After incubation, 100 pl mixtures were inocu- 
lated ontoa monolayer of Vero E6 cells in a 96-well plate for 1h. Each 
serum wasassessed in triplicate. After removing the supernatant, the 
plate was washed twice with DMEM medium. Cells were incubated 
with DMEM supplemented with 2% FBS for 3 days. Subsequently, 
the cells were checked for cytopathogenic effects. 


RNA extraction and PCR 

Whenever commercial kits were used, the manufacturer’s instructions 
were followed without modification. RNA was extracted from 200 pl 
of samples with the High Pure Viral RNA kit (Roche). RNA was eluted in 
50 pl of elution buffer and used as the template for RT-PCR. 

For qPCR analysis, primers based on the S gene of 2019-nCoV were 
designed: RBD-qF1, 5’-CAATGGTT TAACAGGCACAGG-3’; RBD-qRI1, 
5’-CTCAAGTGTCTGTGGATCACG-3’. RNA extracted as described 
above was used for qPCR using the HiScript II One Step (RT-PCR 
SYBR Green Kit (Vazyme Biotech). Conventional PCRs were also 
performed using the following primer pairs: ND-CoVs-951F, 5’-TGT- 
KAGRTTYCCTAAYAT TAC-3’; ND-CoVs-1805R, 5’-ACATCY TGATAN- 
ARAACAGC-3’. The 20-pl qPCR reaction mix contained 10 pl 2x One 
Step SYBR Green mix, 1 p] One Step SYBR Green Enzyme mix, 0.4 pl 
50x ROX Reference Dye 1, 0.4 pl of each primer (10 LM) and 2 ul 
template RNA. Amplification was performed as follows: 50 °C for 
3 min, 95 °C for 30s followed by 40 cycles consisting of 95 °C for10s 
and 60 °C for 30s, and a default melting curve step in an ABI 7500 
Real-time PCR machine. 


Serological test 

In-house anti-SARSr-CoV IgG and IgM ELISA kits were developed using 
SARSr-CoV Rp3 N protein as antigen, which shared more than 90% 
amino acid identity to all SARSr-CoVs?. For IgG analyses, MaxiSorp 
Nunc-immuno 96-well ELISA plates were coated (100 ng per well) over- 
night with recombinant N protein. Human serawere used at a dilution 
of 1:20 for1hat37 °C. An anti-human IgG HRP-conjugated monoclonal 
antibody (Kyab Biotech) was used at a dilution of 1:40,000. The OD 
value (450-630 nm) was calculated. For IgM analyses, MaxiSorp Nunc- 
immuno 96-well ELISA plates were coated (500 ng per well) overnight 
with anti-human IgM (yp chain). Human sera were used at a 1:100 dilu- 
tion for 40 min at 37 °C, followed by incubation with an anti-Rp3 N 
HRP-conjugated antibody (Kyab Biotech) at a dilution of 1:4,000. The 
OD value (450-630 nm) was calculated. 


Examination of ACE2 receptor for 2019-nCoV infection 

HeLacells transiently expressing ACE2 were prepared using Lipofectamine 
3000 (Thermo Fisher Scientific) ina 96-well plate; mock-transfected cells 
were used as controls. 2019-nCoV grown in Vero E6 cells was used for 
infection at a MOI of 0.5. APN and DPP4 were analysed in the same way. 
The inoculum was removed after absorption for 1 h and washed twice 
with PBS and supplemented with medium. At 24 h after infection, cells 
were washed with PBS and fixed with 4% formaldehyde in PBS (pH 7.4) 
for 20 min at room temperature. ACE2 expression was detected using 
amouse anti-S tag monoclonal antibody anda FITC-labelled goat anti- 
mouse IgGH&L (Abcam, ab96879). Viral replication was detected using 
arabbit antibody against the Rp3 N protein (generated in-house, 1:1,000) 
and aCy3-conjugated goat anti-rabbit IgG (1:200, Abcam, ab6939). Nuclei 
were Stained with DAPI (Beyotime). Staining patterns were 
examined using confocal microscopy onaFV1200 microscope (Olympus). 


High-throughput sequencing, pathogen screening and genome 
assembly 

Samples from patient BALF or from the supernatant of virus cul- 
tures were used for RNA extraction and next-generation sequencing 
(NGS) using BGI MGISEQ2000 and Illumina MiSeq 3000 sequencers. 
Metagenomic analysis was carried out mainly based on the bio- 
informatics platform MGmapper (PE_2.24 and SE _2.24). The raw 
NGS reads were first processed by Cutadapt (v.1.18) with minimum 
read length of 30 base pairs. BWA (v.0.7.12-r1039) was used to align 
reads to a local database with a filter hits parameter of 0.8 FMM 
((match + mismatch)/read length = fraction] value and minimum 
alignment score of 30. Parameters for post-processing of assigned 
reads were set toa minimum size normalized abundance of 0.01, 


minimum read count of 20 and were otherwise set to default param- 
eters. A local nucleic acid database for human and mammals was 
used to filter reads of host genomes before mapping reads to the 
virus database. The results of the metagenomic analysis were dis- 
played as pie charts using Microsoft Office 2010. NGS reads were 
assembled into genomes using Geneious (v.11.0.3) and MEGAHIT 
(v.1.2.9). PCR and Sanger sequencing was performed to fill gaps in 
the genome. 5’-rapid amplification of cDNA ends (RACE) was per- 
formed to determine the 5’-end of the genomes using a SMARTer 
RACE 5’/3’ kit (Takara). Genomes were annotated using the Clone 
Manager Professional Suite 8 (Sci-Ed Software). 


Phylogenetic analysis 

Routine sequence management and analysis was carried out using 
DNAStar. The sequence alignment of complete genome sequences 
was performed using MAFFT (v.7.307) with default parameters. The 
codonalignments of full-length S and RdRp gene sequences were con- 
verted from the corresponding protein alignments by PAL2NAL (v.14); 
the protein alignments were created by Clustal Omega (v.1.2.4) using 
default parameters. Maximum likelihood phylogenetic trees were 
generated using RAXML (v.0.9.0) with GTR+G substitution model and 
1,000 bootstrap replicates. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| NGS raw reads of sample WIV04 mapping to the 2019-nCoV sequence. The x axis indicates the genome nucleotide position and they axis 
represents the read depth of the mapping. 
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Extended Data Fig. 2 | Phylogenetic trees based onthe complete S and RdRp 
gene sequences of coronaviruses. a, b, Phylogenetic trees on the basis of the 
gene sequences of S (a) and RdRp (b) are shown. 2019-nCoV and bat CoV RaTG13 
areshown in bold and in red. Thetrees were constructed using the maximum 
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Extended Data Fig. 3 | Aminoacid sequence alignment of theS1 proteinof sequences. The short insertions in the N-terminal domain of the 2019-nCoV are 
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are indicated by the red box. The key amino acid residues involved in the Zhejiang province. 
interaction with human ACE2 are numbered at the top of the aligned 
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Extended Data Fig. 4| Molecular detection method used to detect 
2019-nCoV.a, Standard curve for qPCR primers. The PCR product of theS gene 
that was serial dilutedin the range of 10° to 101 (lines from left to right) was used 
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as atemplate. Primer sequences and experimental conditions are described in 
the Methods. b, Specificity of the qPCR primers. Nucleotide samples from the 
indicated pathogens were used. 
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Extended Data Fig. 5 | Amino acid sequence alignment of the nucleocapsid protein of 2019-nCoV to bat SARSr-CoV Rp3 and SARS-CoV BJO1. Bat SARSr-CoV 
Rp3 was obtained fromR. sinicus, whichis found in Guangxi province. 
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Extended Data Fig. 6 | Isolation and antigenic characterization of 
2019-nCovV. a, b, Vero E6 cells are shown at 24 h after infection with mock virus 
(a) or 2019-nCoV (b).c, d, Mock-virus-infected (c) or 2019-nCoV-infected (d) 
samples were stained with rabbit serum raised against recombinant SARSr-CoV 
Rp3 N protein (red) and DAPI (blue). The experiment was conducted twice 
independently with similar results. e, The ratio of the number of reads related 


to 2019-nCoV among the total number of virus-related reads in metagenomics 
analysis of supernatants from Vero E6 cell cultures. f, Virus growth in Vero E6 
cells. g, Viral particles in the ultrathin sections were imaged using electron 
microscopy at 200 kV. The sample was from virus-infected Vero E6 cells. The 
inset shows the viral particles in an intra-cytosolic vacuole. 
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Extended Data Fig. 7 | Analysis of 2019-nCoV receptor usage. Determination detected using mouse anti-S tag monoclonal antibody. ACE2, APN and DPP4 
of virus infectivity in HeLa cells with or without the expression of human APN proteins (green), viral protein (red) and nuclei (blue) are shown. Scale bars, 
and DPP4. The expression of ACE2, APN and DPP4 plasmids with S tag were 10 um. 


Extended Data Table 1| Patient information and diagnosis history 


Patient No. Gender Age Date of Onset Date of Admission Symptoms When Admitted Current Status (2020.01.13) Diagnosis history 
ICU-01* Male 62 2019.12.12 2019.12.27 fever recover, discharged negative 
ICU-04 Male 32 2019.12.19 2019.12.29 fever, cough, dyspnea fever, intermittent cough negative 
ICU-05 Male 40 2019.12.17 2019.12.27 fever (38 °C), expectoration, malaise, dyspnea _ fever, malaise, intermittent cough AdV (IgM) 
ICU-06 Female 49 2019.12.23 2019.12.27 fever (37.9 °C), palpitation fever, malaise, cough Coronavirus (nt) 
ICU-08 Female 52 2019.12.22 2019.12.29 fever (38.5 °C), expectoration, malaise, dyspnea recover, discharged Streptococcus pneumoniae (nt) 
ICU-09 Male 40 2019.12.22 2019.12.28 fever (38.5 °C), expectoration fever (38.5 °C), malaise, expectoration, dizziness negative 
ICU-10 Male 56 2019.12.20 2019.12.20 fever, dyspnea, chest tightness fever, malaise, cough, dyspnea negative 


Note, some records are missing. All patients are sellers or deliverymen at the seafood market except ICU-01, whose contact history is unclear. All patients were admitted to intensive care unit 
(ICU) during the first investigation and were now in stable condition. Blood IgM tests have been performed for the following respiratory pathogens for all patients: Legionella pneumophilia, 
Mycoplasma pneumoniae, Chlamydia pneumoniae, respiratory syncytial virus, adenovirus, Rickettsia, influenza A virus, influenza B virus and parainfluenza virus. 

*This patient reported fever on 12 December 2019 and then recovered without medical treatment. He came back to the hospital on 27 December 2019 with a fever. His wife was also ill and 


admitted to the hospital. Both individuals recovered. 
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Extended Data Table 2 | Laboratory results 


First sampling-2019.12.30 Second sampling-2020.01.10 
Patient No. TestNo. BALF OralSwab Blood(Ab) OralSwab  AnalSwab  Blood(PCR) Blood (Ab) 
ICU-01 WIv0l - Ct=32.0 NA NA NA NA NA 
ICU-04 WIV02* Ct=17.6 Ct=26.6 NA - - - + 
ICU-05 WIV03) -Ct=27.0 = Ct=31.9 NA - - - + 
ICU-06 WIV04" = Ct=18.3. Ct=27.7 + 2 2 : + 
ICU-08 WIV0S* = Ct=24.1 - NA NA NA NA NA 
ICU-09 WIV06* §=Ct=21.6 Ct=29.4 NA - - - + 
ICU-10 WIVO7* = Ct=25.7. Ct=24.0 NA - - - + 


Samples from two patients (ICU-01 and ICU-08) were not available during the second investigation. They had been discharged from the hospital. We did a serial test for patient ICU-06 on the 
following dates: 30 December 2019, 31 December 2019, 1 January 2020 and 10 January 2020, corresponding to 7, 8, 9 and 18 days after disease onset (23 December 2019). Molecular and 
serological (IgM and IgG) virus-detection results for 2019-nCoV are shown. NA, not available. 

*Virus isolated. 

*A full-length genome was obtained. 


Extended Data Table 3 | Genomic comparison of 2019-nCoV WIV0O4 with SARS-CoVs and bat SARSr-CoVs 
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76.3/68.9 
75.8/67.2 
76.3/68.9 
76.3/68.9 
75.8/68.9 
98.4/100 
78.0/73.8 
TIA/72.1 
78.0/70.5 
76.9/67.2 
76.9/70.5 
78.5/70.5 
95.2/93.4 
95.2/93.4 
76.3/68.9 
77.4/68.9 
77.4/68.9 
80.0/71.8 
78.5/70.5 
76.9/68.9 
79.0/68.9 
75.3/67.2 


ORF7a 


82.8/86.0 
82.8/86.0 
82.8/86.0 
82.8/86.0 
82.8/86.0 
95.6/97.5 
85.0/88.4 
85.0/88.4 
84.4/88.4 
85.0/88.4 
83.1/87.6 
82.0/85. 
88.8/87.6 


89.1/88.4 
84.2/87.6 
85.0/86.8 
84.2/86.8 
85.2/87.6 
84.7/88.4 
83.9/89.3 
84.2/86.8 
85.0/89.3 


Sequence identities with SARS-CoVs & bat SARSr-CoVs (nt/aa %) 


ORF7b 


84.8/81.4 
84.8/81.4 
84.8/81.4 
84.8/81.4 
84.8/81.4 
99.2/97.7 
85.6/83.7 
85.6/83.7 
85.6/83.7 
85.6/83.7 
86.4/83.7 
81.1/81.4 
94.7/93.0 
95.5/93.0 
85.6/83.7 
84.1/79.1 
85.6/83.7 
84.8/83.7 
85.6/83.7 
84.8/83.7 
84.1/83.7 
84.1/79.1 


ORF8 


52.0/31.6 
SL.1/- 
SL.1/- 
52.3/31.6 
52.3/- 
97.0/95.0 
65.8/57.9 
65.3/57.9 
65.8/58.7 
65.3/57.9 
50.3/31.3 
66.7/57.9 
88.5/94.2 
88.5/94.2 
62.0/49.6 
51.4/31.6 
49.7/30.4 
66.1/55.4 
65.8/58.7 
66.4/56.2 
50.6/31.3 
66.4/57.0 


N 


87.7/91.2 
88.8/91.2 
88.8/91.2 
88.8/91.2 
88.5/90.7 
96.9/99.0 
88.5/90.9 
88.6/90.9 
88.6/90.9 
88.8/91.4 
88.3/90.5 
89.0/91.6 
91.1/94.3 
91.2/94.3 
88.9/91.6 
87.9/90.2 
86.9/90.2 
88.2/91.2 
87.9/91.2 
88.4/90.7 
86.8/89.5 
88.3/90.0 
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Extended Data Table 4 | Virus neutralization test of serum samples 


Samples VNT titre for nCoV-2019 
Healthy people #1 from Wuhan neg 

Healthy people #2 from Wuhan neg 

Horse anti-SARS-CoV serum >1:80 


WIV02 >1:80 
WIV03 1:40 

WIV04 >1:80 
WIV06 >1:80 
WIV07 >1:80 


Each serum sample was tested in triplicate. Serum samples from two healthy individuals from Wuhan and five patients as well as a horse anti-SARS-CoV anti-serum were used. 
We used 120 TCID;, viruses per well. Serum samples were used at dilutions of 1:10, 1:20, 1:40 and 1:80. neg, negative; VNT, virus neutralization test. 
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Sample size Samples of seven pneumonia patients are available from the clinical hospital to be sent to Wuhan Institute of Virology for pathogen 
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infected by the same virus. Therefore, the sample size is sufficient for conducting the following study which aims to identify and characterize 
the causative agent of this pneumonia outbreak. 
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Ltd,Wuhan, China, dilution: 1:40000); 3. Anti-Rp3 NP-HRP conjugated (Kyab Biotech Co.,Ltd, Wuhan, China, dilution: 1:4000); 
4.FITC-labelled goat anti-mouse IgG H&L (Abcam, ab96879, dlilution 1:100); 5. cyanin 3-conjugated goat anti-rabbit IgG 
(Abcam,ab69339, dilution: 1:200); 6. mouse anti-S tag monoclonal antibody made in house 


The house-made SARSr-CoV Rp3 NP antibodies and anti-S tag monoclonal antibody were validated in a WB. The cy3-conjugated 
anti-rabbit IgGs were validated in IFA. The FITC-labelled goat anti-mouse IgG H&L was validated in IHC. 
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ATCC. 


All monkey and human cells were from ATCC with authentication. The authentication was performed by microscope 
morphology check, growth curve analysis or identity verification with STR analysis (for human cell lines). 
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Despite the resounding clinical success in cancer treatment of antibodies that block 
the interaction of PD1 with its ligand PDL1', the mechanisms involved remain 
unknown. A major limitation to understanding the origin and fate of T cells intumour 
immunity is the lack of quantitative information on the distribution of individual 
clonotypes of T cells in patients with cancer. Here, by performing deep single-cell 
sequencing of RNA and T cell receptors in patients with different types of cancer, we 
survey the profiles of various populations of T cells and T cell receptors intumours, 
normal adjacent tissue, and peripheral blood. We find clear evidence of clonotypic 
expansion of effector-like T cells not only within the tumour but also innormal 
adjacent tissue. Patients with gene signatures of such clonotypic expansion respond 
best to anti-PDL1 therapy. Notably, expanded clonotypes found in the tumour and 
normal adjacent tissue can also typically be detected in peripheral blood, which 
suggests aconvenient approach to patient identification. Analyses of our data 
together with several external datasets suggest that intratumoural T cells, especially 
inresponsive patients, are replenished with fresh, non-exhausted replacement cells 
from sites outside the tumour, suggesting continued activity of the cancer immunity 


cycle in these patients, the acceleration of which may be associated with clinical 


response. 


We sequenced 330 million mRNA transcripts in 141,623 T cells from 14 
treatment-naive patients across four different types of cancer (Extended 
Data Fig. 1a—i, Supplementary Tables 1,2). For each patient, samples were 
obtained from surgically resected tumour and histologically normal 
adjacent tissue (NAT), and, in four patients, from peripheral blood. 
Single-cell T cell receptor (TCR) sequencing (scTCR-seq) yielded one or 
more complementarity-determining region 3 (CDR3) regions of a- or 
B-chains in 99,788 cells. T cells were grouped into 56,975 distinct clono- 
types by matching CDR3 regions, which allowed us to measure clonal 
expansion and track clonal lineages across tissues. Although individual 
clonotypes were rarely shared across patients (Supplementary Table 3), 
many clonotypes resided in both the tumour and the corresponding NAT 
from agiven patient (Extended Data Fig. 2a). Such dual-expanded clones 
contrasted with expanded clones that resided solely in one compart- 
ment (termed tumour or NAT multiplets), and with clones having only 
one observed cell in either compartment (singletons). 

Although most clonotypes were singletons (Extended Data Fig. 2b), 
representing a diverse repertoire of T cells, 9-18% of clonotypesin each 


patient were clonally expanded multiplets or dual-expanded clones. 
Dual-expanded clonotypes constituted substantial fractions of observed 
cells, sometimes the majority of T cells ina given compartment (Extended 
Data Fig. 2c). Analysing clones by their cell counts in NAT and tumour 
samples revealed diverse clonal expansion behaviours across patients 
(Extended Data Fig. 2d). In some patients—for example, Colonl— 
dual-expanded clones aligned along the main diagonal, reflecting 
approximately equal cell countsin the two compartments and suggest- 
ing coordinate or parallel processes intumour and NAT, brought about 
by multiple clonotypes acting similarly. Conversely, in other patients, 
dual-expanded clones werescattered from the main diagonal, indicating 
that each clone had an independent, divergent pattern of expansion, 
migration or attrition in the tumour and NAT compartments. 

We hypothesized that parallel expansionin tumour and NAT might 
represent infiltration of T cells from the periphery, extravasating 
into both tissues equally from inflamed blood vessels”. We found 
supporting evidence from matched blood samples, in which clones 
highly expanded in blood showed evidence of parallel expansion in 
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Tissue infiltration. Bar plots show distributions of cells 
by tissue expansion patterns from blood-independent 
(Ind), non-expanded (Non) and expanded clones (Exp). 
Values for dual-expanded clones (D) are shown. Pvalues 
were determined by achi-square test on counts of tissue- 
resident cells (those in tumour or NAT but notin blood). 
e, Detection of tissue TCRs in blood. Bar plotsshow 
fractions of tissue-resident cells with clonotypes 
observed ina blood-expanded clone, for each tissue 
expansion pattern (values for D are shown).n, NAT 
singleton; N, NAT multiplet; t, tumour singleton; 
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tumour and NAT (Fig. 1a). To quantify our observations in these four 
patients, we classified clones not only by their expansion in tissue 
but also by their peripheral expansion, as measured by clone sizes in 
blood (Fig. 1b, Extended Data Fig. 2e). Clones with single or multiple 
constituent cells in blood were termed blood non-expanded and blood- 
expanded, respectively, whereas clones without any associated cells 
inblood were termed blood-independent. With this schema, we found 
that an aggregate measure of peripheral expansion over all clones—as 
determined by the fraction of blood T cells from blood-expanded 
versus non-expanded clones (Fig. 1c)—associated with higher levels 
of lymphocytic infiltration by blood-expanded clones into tumour 
and NAT samples, and served as a proxy for the extent of dual expan- 
sion (Fig. 1d). Notably, in patients with strong dual expansion, dual- 
expanded clones in tissue could frequently be detected as expanded 
in blood (Fig. 1e). 

Our scTCR-seq data allowed us to make observations at the level 
of clones rather than individual T cells. By studying clones, we found 
that peripheral and intratumoural clone sizes were significantly cor- 
related (P= 6.3 x 10°, P=8.2 x 10° and P= 0.0038) in our dataset 
as well as in two external datasets** (Fig. 1f-h), which indicates that 
the relationship between peripheral expansion and tumour infiltra- 
tion held not only for aggregate cell fractions but also for individual 
clones. More notably, a general and significant (P= 5.5 x 10 *) associa- 
tion was observed across patients in the combined datasets between 
the total amount of peripheral expansion and parallel dual expansion 
(Fig. li, Extended Data Fig. 3). Collectively, our results show a strong 


tumour and blood. i, Peripheral expansion and parallel 
dual expansion. Values aresummarized for patientsin f- 
h. Pvalues were determined by atwo-tailed linear 
regression ¢test, after exponentiating y values to 
linearize the data, across patients. 


0.2 0.4 0.6 08 
Peripheral expansion 


relationship between peripheral clonal expansion and parallel infiltra- 
tion into tumours and NAT. 

We explored the phenomenon of clonal expansion further by simul- 
taneously measuring gene expression using single-cell RNA sequenc- 
ing (ScCRNA-seq). Transcriptional profiles of individual T cells allowed 
grouping of similar cells into clusters (Fig. 2a), which we characterized 
by cross-labelling our cells against reference gene signatures from 
published datasets? > (Extended Data Fig. 4a), and by profiling genes 
we found tobe cluster-specific (Extended Data Fig. 4b, Supplementary 
Table 4). Cluster analysis identified T effector cells (8.1-T,,,), T effector 
memory cells (8.2-T,..,), and regulatory T cells (4.6a-T,.g, 4.6b-T,.,), aS 
well as subtypes or physiological states of resident memory T (T,,,) 
cells®, which shared abundant expression of /TGAE (also known as 
CD103) and ZNF683 (Hobit), a regulator of tissue residency’, as well as 
positive scores on a published gene signature of T,,, cells® (Extended 
Data Fig. 4c). These subclusters, designated 8.3a-T,,,, 8.3b-T,,, and 8.3c- 
Tim differed by low, intermediate and high expression of activation- 
exhaustion markers, respectively, including PD1 (also known as PDCD1) 
(Extended Data Fig. 4d), and also by a published gene signature for ter- 
minally exhausted versus stem-like exhausted CD8' T cells’ (Extended 
Data Fig. 4e), consistent with recently identified phenotypes of pro- 
genitor exhausted and terminally exhausted T,,, cells!°. Subcluster 
8.3c had lower expression of activation marker XCL1 (which encodes 
lymphotactin), while 8.3b had exclusively high expression of KLRC2 
(also known as NKG2-C) and KLRC3 (NKG2-£) (Extended Data Fig. 4b), 
observed in CD8* T cells with both TCR-dependent and innate-like, 
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Fig. 2|T cell clusters and clonal expansion. a, Cluster analysis. T cells fromall 
14 patients are plotted by uniform manifold approximation and projection 
(UMAP) dimensionality reduction of the scRNA-seq data, and coloured by 
cluster assignment, with cluster regions approximated by labelled ovals. The 
curved line highlights the separation of CD8 (left) and CD4 (right) T cells. 

b, Tissue expansion patterns by cluster. T cells intumour and NAT witha 
clonotype are plotted by UMAP coordinates from aand coloured by tissue 
expansion pattern of their parent clone using a two-dimensional palette. 

c, Clonal expansion by cluster and blood expansion pattern. Clones from Fig. la 
are plotted by blood independence, non-expansion and expansion (rows) and 
by primary cluster (columns), using coordinates for NAT and tumour clone size, 
and dot size for blood clone size. Grey lines separate NAT and tumour clone 
sizes of zero and non-zero. Two-sided Pvalues are shown when P<0.01 froma 
Pearson’s correlation coefficient ron log-transformed clone sizes for dual- 
expanded clones. d, Analysis of 8.1-T.;;clones. Scatter plots from c for 8.1-T.,;for 
blood-independent, non-expanded and expanded clones are shown as 
distributions of clone sizes in tumour plus NAT for each tissue expansion 
pattern, abbreviated as in Fig. 1b. Vertical jitter distinguishes overlapping 
points. Two-sided Pvalues are shown for Bonferroni-adjusted P< 0.01 froma 
t-test on log-transformed clone sizes for dual-expanded clones. 


TCR-independent mechanisms of activation". In addition, subclus- 
ters 8.3a to 8.3c showed increasing prevalence in tumour versus NAT 
(Extended Data Fig. 4f), supporting a relationship between PD1 expres- 
sion, terminal exhaustion, and increased exposure to cognate antigens 
within the tumour microenvironment”. 

In addition, our deep sequencing analysis revealed several clusters 
of T cells not matching published reference gene signatures (Extended 
Data Fig. 4a). We characterized these clusters using gene set enrich- 
ment analysis, which showed active histone modification and mitosis 
inclusters 8.4.and 8.5, respectively (Extended Data Fig. 4g). Chromatin 
regulation in 8.4 was further evidenced by high expression of the long 
noncoding RNA MALATI® and the chromatin remodelling enzyme 
CHDI* (Extended Data Fig. 4b). Cluster 3.1 expressed abundant mito- 
chondrial genes, suggestive of apoptosis, and we set this cluster aside as 
generic CD3‘ cells. Clusters 8.4-Chrom and 8.5-Mitosis had similarities 
toT,,, cells, with cells expressing a range of /TGAEF levels (Extended Data 
Fig. 4b) andthe highest mean expression of PDI (Extended Data Fig. 4d), 
generally considered a marker of exhaustion. Moreover, 8.5-Mitosis 
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alsocross-labelled most closely with clusters that have been reported 
torepresent exhausted T cells, specifically, the CD8_ex cluster observed 
by Yost et al. and the CD8_C6-LAYN cluster observed by Guo et al.? 
(Extended Data Fig. 4a). However, neither 8.4-Chrom nor 8.5-Mitosis 
scored highly on the published signature for T,,, cells, which are thought 
to be the main source of exhausted T cells (Extended Data Fig. 4c). 

Combining scTCR-seq and scRNA-seq data yielded insights into the 
clonal expansion behaviour of clones and T cells. One challenge in such 
combined analysis is that whereas expansion behaviour is attributable 
toclones, clusters are assigned to individual cells based ontheir tran- 
scriptional profiles. Therefore, a given clone may comprise a mixture 
of T cell phenotypes with distinct transcriptional profiles and hence 
different clusters. Accordingly, clones in our dataset exhibited arange 
of transcriptional diversity: some clones were homogeneous for a sin- 
gle cluster, whereas others had heterogeneous cellular mixtures of 
several clusters (Extended Data Fig. 4h, i). Nevertheless, most clones 
hadaprimary cluster that predominated and served as an approximate 
phenotype for subsequent analysis. 

As expected, clones partitioned into CD8 or CD4 types, which exhib- 
ited distinct tissue expansion behaviours (Extended Data Fig. 5a, b). 
Clones of primarily 8.1, 8.2 and 8.3 T cells were largely dual-expanded 
(Fig. 2b), whereas clones of CD4* T cells were generally singletons, 
with some exceptions: (1) low-level dual expansion by 4.4-FOS clones; 
(2) tumour and NAT multiplet expansion by clones of 4.1-T,,, cells; 
and (3) tumour multiplet expansion by 4.5-IL6ST and 4.6-T,.g clones 
(Extended Data Fig. 5c). Clusters 8.4, 8.5 and 8.6 showed very little 
clonal expansion intissue. Our analysis of external datasets** yielded 
similar results, with the analogous CX3CR1and GZMK clusters showing 
similar patterns of clonal expansion to our 8.1-T,,,and 8.2-T,,, clusters, 
respectively (Extended Data Fig. 5d, e). 

When wemapped primary clusters onto clones from our four patients 
with blood samples (shown previously in Fig. 1a), we observed that 
many clones exhibiting parallel dual expansion were 8.1-T,,, clones 
(Extended Data Fig. 6a). To investigate this association, we re-grouped 
clones from the patients according to their blood expansion patterns 
and primary cluster (Fig. 2c), and found that blood-expanded clones 
witha primary cluster of 8.1-T,,,had a significant (P=9 x10“) correla- 
tion between tumour and NAT clonesizes. In addition, 8.3b-T,,, clones, 
possibly containing T cells with TCR-independent activation, showed 
lower levels of parallel dual expansion. Nevertheless, 8.1-T,;,clones were 
morenumerous and were the only ones to showastatistically significant 
(P=1.4 x 10° and P=3.0 x 10°) relationship between blood expan- 
sion and dual-expanded clone size (Fig. 2d, Extended Data Fig. 6b), 
indicating their predominant role in peripheral clonal expansion and 
tumour infiltration. 

Blood-expanded 8.1-T,,, clones may have a role not only in tumour 
infiltration but also in differentiation into other T cell phenotypes”. 
Although blood-expanded 8.1-T,,,clones constituted only a small frac- 
tion of all clones (Extended Data Fig. 6c, d), they hada disproportion- 
ate influence on T cell composition in blood and tumours. By linking 
individual T cell phenotypes with the blood expansion patterns of 
their parent clones, we observed that 8.2-T,,,, and 8.3-T,,, cells in blood 
(Extended Data Fig. 6e, f) and tumours (Extended Data Fig. 6g, h) shared 
clonotypes extensively with blood-expanded clonal lineages of 8.1-Ter¢ 
cells. Thus, differentiation of blood-expanded 8.1-T,,,clones may yield 
a diverse array of T cell subtypes in blood and tumours. By contrast, 
most 8.2-T,,, and 8.3-T,,, cells in NAT belonged to blood-independent 
clones (Extended Data Fig. 6i, j), consistent with mechanisms of local 
expansion. 

Our datacapture onlya singlesnapshot of clone sizes across tumour, 
NAT and blood, representing the accumulation over time of various 
dynamic processes. To understand these dynamics, we analysed an 
external dataset’ of pre- and post-treatment T cells from basal cell 
carcinomas, with the understanding that the two sets of measurement 
differ both by the effects of immunotherapy and bythe passage of time. 


The published analysis demonstrated that new clones appear after 
treatment, with clonotypes distinct from those detected before treat- 
ment. We extended the analysis by matching novel clones in tumours 
after treatment with pre-existing clones in blood before treatment, as 
assayed by bulk TCR sequencing (TCR-seq) (Extended Data Fig. 7a). We 
found significant (P= 2.3 x10 Sand P=1.3 x 10°) correlations in clone 
size and transcript counts among novel CD8* cells (Fig. 3a, Extended 
Data Fig. 7b) but not novel CD4* cells (Extended Data Fig. 7c), which 
suggests that at least some of the new CD8* clones originated periph- 
erally. Moreover, although patients differed substantially in number 
and expansion of new clones, these differences associated with the 
degree of clonal diversity in peripheral blood (Fig. 3b, Extended Data 
Fig. 7d), further supporting a relationship between peripheral and 
intratumoural clonal expansion. 

We further categorized new tumour clones based on whether they 
shared TCR sequences with blood before treatment, with the caveat 
that incomplete sampling of TCRs in blood (Extended Data Fig. 7e) 
may misidentify some clones as being blood-independent. Because 
bulk TCR-seq measures total TCR transcripts rather than cells with 
a given TCR, we could not discriminate blood-expanded from non- 
expanded clones, but we could distinguish blood-independent from 
blood-associated clones by the absence or presence of a matching 
bulk TCR-seq transcript. Notably, in the two patients with greatest 
peripheral clonal diversity, we observed a statistically significant 
(P=3.8 x10“ and P=2.2 x10") interaction between blood association 
and exhaustion phenotype (Fig. 3c): non-exhausted clones were more 
likely to be blood-associated, whereas exhausted clones were more 
likely to be blood-independent. Furthermore, non-exhausted clones 
that associated with blood had significantly (P=3.1x 10, P=5.2x 10 
and P=2.0 x10“) greater clone sizes in tumours after treatment than 
those independent of blood. Although patients available for study 
were limited, our observations suggest two possible modes of clonal 
expansion: (1) alocal expansion independent of blood (or possibly 
stemming from peripheral events in the remote past) that generates 
exhausted T cells; and (2) an infiltrative process from contemporary or 
relatively recent blood content that introduces novel, non-exhausted 
clones into the tumour. 

The prevalence of infiltrating T cells, which may be reactive not only 
tothetumour butalsotonon-tumour antigens, could account for recent 
observations of ‘bystander’ T cells that are reactive against non-tumour 
antigens” and might help explain recent findings that only a small 
fraction of intratumoral CD8* T cells are reactive against tumour’. We 
compared our CDR3 amino acid sequences against databases of CDR3 
sequences that are known or likely to react to common viral antigens. 
Although TCRs often matched those corresponding to common viral 
antigens, almost all ofthese TCRs belonged to singletonclonesandthere- 
fore showed no evidence of clonal expansion. A few TCRs in each patient 
did exhibit high levels of clonal expansion (Extended Data Fig. 8a), 
which suggests that although bystander T cells may be prevalent at 
the cellular level, relatively few clones may be responding to non- 
tumour antigens. We observed over-representation of virally matching 
TCRs among dual-expanded clonotypes (P= 3.7 x 10° and P=0.0018) 
(Extended Data Fig. 8b) and various CD8 clusters (Extended Data 
Fig. 8c), which is consistent with the infiltration of clonally expanded 
bystander T cells from the periphery. 

As our analysis indicates, patients vary in their extent of peripheral 
clonal expansion and resulting infiltration of T cells, and this variability 
could potentially explain differing clinical responses to immune check- 
point blockade. We evaluated bulk RNA-seq tumour samples from three 
randomized phase II trials of the anti-PDL1 antibody atezolizumab'*° 
using gene signatures of tissue expansion patterns involving tumour tis- 
sue: tumour singletons, tumour multiplets, and dual-expanded clones 
(Extended Data Fig. 9a, b, Supplementary Table 5). Because clonal 
expansion occurs primarily with CD8* T cells, these gene signatures 
correlated highly with CD8A expression (Extended Data Fig. 9c)—an 
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Fig. 3| Peripheral clonal expansion and novel intratumoural clones. 

a, Correlated clone sizes intumour and blood. Scatter plots of tumour clone 
size after treatment (post-rx) and transcript counts in blood before treatment 
(pre-rx) are shown for the two patients in Yost et al.° with bulk TCR-seq blood 
samples and the largest numbers of novel clones. Vertical lines separate blood- 
independent (left) and blood-associated (right) clones. Clones are coloured by 
primary cluster from the original analysis, as inc. Two-sided Pvalues are shown 
from Pearson’s correlation coefficients ron log-transformed values from novel 
blood-associated CD8 clones. b, Blood clonal diversity and novel clones. Data 
from Extended Data Fig. 7b, d are summarized, showing the total numbers of 
novel CD8 clones (top) and cells (bottom) after treatment across patientsasa 
function of blood clonal diversity (average of pre-and post-treatment 
measurements). Two-sided P values are shown froma linear regression t-test 
across patients, onthe logarithm of yvalues to linearize the data. c, Non- 
exhausted newclones and blood association. Novel clones in Yost et al.°are 
separated by pre-treatment blood association (Blood’) or blood independence 
(Blood), and further divided by exhaustion or non-exhaustion of their post- 
treatment primary cluster, as defined and coloured inthe legend. Patients are 
ordered by increasing blood clonal diversity. P values (top) are froma chi- 
square test on counts of new CD8 clones. P values (side) are from atwo-sided 
t-test on log-transformed clone sizes (shown if P< 0.01). 


indicator of intratumoural prevalence of CD8* T cells and a reported 
predictor of response to immunotherapy” (Extended Data Fig. 9d). 
Nevertheless, the potential exists for improving on CDS8A expression 
as a predictor, exemplified by the slightly stronger association of pro- 
gression-free survival (PFS) with the expression of CCL5 (also known 
as RANTES)—a marker of T cell activation” that ranked highly in both 
tumour multiplet and dual-expansion signatures—in treatment arms 
inwhich atezolizumab is active (with monotherapy inthe IMmotion150 
trial thought to have been mitigated by immunosuppressive myeloid 
cells”°) (Extended Data Fig. 9e). 

Similarly, gene signatures for tumour multiplets and dual expansion 
were associated with greater PFS in relevant arms, both by the hazard 
ratios of individual genes (Fig. 4a) and by the survival analysis of com- 
bined signature scores (P< 0.02) (Extended Data Fig. 10a). To handle 
confounding expression of CDS8A, we performed survival analyses 
of CD8A expression and expansion signature scores as dual variates 
(Fig. 4b, Extended Data Fig. 10b). Although most patients were concord- 
ant in both measures, the relatively few discordant patients with high 
CD8A expression and low expansion signature scores were anomalous 
and therefore informative. In every atezolizumab-containing arm, 
the discordant group had worse PFS than the concordant group, with 
a hazard ratio > 1 in several cases. Furthermore, a similar dual-variate 
analysis of tumour multiplet and dual expansion signatures showed 
additive predictive power (Extended Data Fig. 10c), which suggests that 
these two types of clonal expansion may contribute independently in 
the clinical response of patients to cancer immunotherapy. 

In summary, dual-expanded T cell clones in tumours, and their deri- 
vation from peripherally expanded clones, may represent the overall 
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Fig. 4| Survival analysis of tissue expansion gene signatures. a, Gene hazard 
ratios. Each gene signature for a tissue expansion pattern involving tumour 
tissue (columns) is represented by hazard ratios in treatment arms (rows) 
after dichotomization within the clinical trial. Vertical jitter distinguishes 
overlapping dots. Hazard ratio <1 (left of vertical lines) indicates greater PFS. 
Mean hazard ratio and one-sided Pvalues are shown from aone-sample z-test 
on hazard ratios for genes inthe signature, highlighted with associated datain 
red when Bonferroni-adjusted P< 0.05. b, Survival analysis of signatures with 


strength of immune response, or the ‘set point’ of the immune system 
against tumour or foreign antigens. Our study presents an alternative 
tothe widely presumed mechanism that immune checkpoint blockade 
acts on chronically stimulated T cells toreversea terminally differenti- 
ated or ‘exhausted’ state observed originally during chronic virusinfec- 
tion’*5, Recent observations have also questioned this model, showing 
that the phenotype of terminal exhaustion is epigenetically locked 
and difficult to alter”*””. In addition, recent studies have linked clini- 
cal response to stem-like memory CD8*' T cells rather than exhausted 
ones”’, Overall, our study suggests that non-exhausted T cells and T cell 
clones supplied from the periphery may be key factors in explaining 
patient variability and clinical benefit from cancer immunotherapy. 

Clinical benefit could arise froma direct effect of PD1 or PDL1 block- 
ade oneffector T cells” or other non-exhausted T cells, or because 
blockade can increase the production or effectiveness of commit- 
ted anti-tumoural T cells only in those patients with an ongoing T cell 
response and continued replenishment of tumour-infiltrating lympho- 
cytes. As a practical consequence of this study, the observed correla- 
tions between TCR repertoires of dual-expanded clones in tumours 
and those of peripherally expanded clones suggest that sampling and 
identifying expanded clones in blood may characterize the TCR com- 
position of clinically relevant intratumoural T cells, expanding the 
possibilities for ‘liquid biopsies’*°*. 
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Methods 


Reagents and antibodies 

Antibodies used in FACS sorting were: EpCAM: Biolegend 9C4 antibody, 
cat. no.324208, lot B198775, fluorochrome APC. CD45: Biolegend 2D1 
antibody, cat. no. 368516, lot B264243, fluorochrome APC/Cy7. CD3: BD 
Biosciences SK7 antibody, cat. no. 564001, lot 6109971, fluorochrome 
BUV395. 


Software versions 

Data were collected using Cell Ranger software (10x Genomics) v.2.2.0. 
Data were analysed using Perl v.5.18.2, R v.3.6.0, and the following 
packages and versions inR for analysis: Seurat, 3.0.2; SingleR, 1.0.1°4; 
RankProd, 3.11.0*; GSEABase, 1.47.0; limma, 3.41.15**; annotate, 1.63.0; 
homologene, 1.4.68.19.3.27; mouse4302.db, 3.2.3; cocor, 1.1-3°7; iNEXT, 
2.0.19°8; and survival, 2.44-1.1°°. Two-dimensional gene expression 
maps were generated using coordinates from the UMAP algorithm*, 
as implemented in the umap-learn module v.0.3.10 in Python v.3.6.0, 
run under R using reticulate v.1.13. Figures were produced using the 
following packages and versions in R: colorspace, 1.4-17; RColorBrewer, 
1.1-2; pheatmap, 1.0.12; and superheat, 0.1.0*°. External datasets were 
obtained using GenomicDataCommons, 1.9.0“ and GEOquery, 2.53.0*. 


Human research participants 

Patient samples were procured from commercial vendors (Discovery 
Life Sciences, ispecimen Inc., Avaden BioSciences and TriMetis Life Sci- 
ences) from adult patients undergoing resection surgery. We complied 
with all ethical standards of the Roche Ethics Committee. Informed 
consent was obtained from all participants. 


Tissue dissociation 

Fresh surgical samples were separated into tumour and NAT compart- 
ments by the reviewing pathologist and shipped overnight to our insti- 
tution. Upon arrival, samples were rinsed with PBS until no traces of 
blood were visually detected. Subsequently, samples were digested 
with a combination of collagenase D (0.5 mg mI“) and DNase (0.1 mg 
ml) for 15 min at 37 °C with gentle shaking. Subsequently, samples 
were subjected to a gentleMACS dissociator (Miltenyi Biotec), followed 
by an additional 10 min incubation at 37 °C. 


Peripheral blood mononuclear cell isolation 

Peripheral blood mononuclear cells from patient blood samples were 
isolated using 50-ml Leucosep tubes (Greiner Bio-One International) 
and Ficoll-Paque PLUS (GE Healthcare). Whole blood drawn into 
sodium heparin blood collection tubes were diluted three times with 
PBS without calcium or magnesium (Lonza). Diluted cell suspensions 
were carefully layered on Leucosep tubes and centrifuged for 15 min 
at 800g at room temperature. Interphase-containing peripheral blood 
mononuclear cells were collected and washed with PBS and subse- 
quently centrifuged for 10 min at 250g at room temperature before 
further processing. 


FACS sorting 

Following tissue enzymatic dissociation of tissues, single-cell suspen- 
sions were subjected to antibody staining with anti-EPCAM, anti-CD45, 
and anti-CD3. Cells were purified by fluorescence-activated cell sorting 
(FACS) on a Becton Dickinson FACSAria Fusion cell sorter equipped 
with four lasers (405 nm, 488 nm, 561nm and 638nm). A 70-um nozzle 
running at 70 psi and 90 kHzwas usedas the setup for each sort session. 
FACSDiva (v.8.0.1) and FlowJo (v.10) were used to collect and analyse 
the flow cytometry data. Before gating on fluorescence, single cells 
were gated using forward scatter (FSC-A) and side scatter (SSC-A) (for 
intact cells) and SSC-W/SSC-H and FSC-W/FSC-H (to ensure that only 
singlets were sorted). FACS gates were drawn to include only live single 
cells based on Calcein BlueAM* and 7-AAD (Thermo Fisher Scientific). 


Further gates were drawn to arrive at CD3*CD45*EpCAM (for CD3* 
selected samples) or CD45*EpCAM cells (for CD45* selected samples). 
Boundaries between positive and negative cell fractions were drawn 
based on single-colour stains. An example gating strategy is shown 
in Supplementary Methods. 


Single-cell RNA-seq and TCR V(D)J clonotype profiling 

Sample processing for single-cell gene expression (scRNA-seq) and 
T cell receptor V(D)J clonotypes (SscTCR-seq) was done using the 
Chromium Single Cell 5’ Library and Gel Bead Kit (10x Genomics), 
following the manufacturer’s user guide. Cell density and viability of 
FACS-sorted CD3*T cells or CD45* cells from tumours, NAT and blood 
were determined by Vi-CELL XR cell counter (Beckman Coulter). All of 
the processed samples had cell viability above 90%. The cell density 
was used to impute the volume of single cell suspension needed in the 
reverse transcription master mix, aiming to achieve approximately 
6,000-10,000 cells per sample. After Gel Bead-in-Emulsion reverse 
transcription (GEM-RT) reaction and clean-up, a total of 14 cycles of 
PCR amplification were performed to obtain sufficient cDNAs used 
for both RNA-seq library generation and TCR V(D)J targeted enrich- 
ment followed by V(D)J library generation. TCR V(D)J enrichment was 
done per manufacturer’s user guide using Chromium Single Cell V(D) 
J Enrichment Kit, Human T cell (10x Genomics). Libraries for RNA-seq 
and V(D)J were prepared following the manufacturer’s user guide (10x 
Genomics), then profiled using Bioanalyzer High Sensitivity DNA kit 
(Agilent Technologies) and quantified with Qubit (Thermo Fisher Scien- 
tific) or Kapa Library Quantification Kit (Kapa Biosystems). Single-cell 
RNA-seq libraries were sequenced in one lane of HiSeq4000 (Illumina). 
Single-cell TCR V(D)J libraries were tagged with a sample barcode for 
multiplexed pooling with other libraries, sequenced in both lanes of a 
HiSeq2500 machine (Illumina) using Rapid Run mode, and then demul- 
tiplexed. All sequencing was done according to the manufacturer’s 
specification (10x Genomics). 


Processing of scTCR-seq data 

TCR-seq data for each sample was processed using Cell Ranger soft- 
ware with the command ‘cellranger vdj’ using a custom reference set 
of 30,727 genes, based on human reference genome GRCh38 and Ref- 
Seq gene models, and provided in our software package (see ‘Code 
availability’). For each sample, Cell Ranger generated an output file, 
filtered_contig_annotations.csv, containing TCR a-chain and B-chain 
CDR3 nucleotide sequences for single cells that were identified by 
barcodes. 

Although Cell Ranger software groups cells according to CDR3 nucle- 
otide sequences to obtain clonotypes separately for each sample, our 
analysis depends onidentifying shared clonotypes across samples. We 
therefore re-grouped clonotypes across the tumour, NAT and blood 
samples for each patient, requiring that they share all reported a-chain 
and B-chain CDR3 consensus nucleotide sequences in common. We 
tested our procedureon each sample individually and confirmed that 
it grouped clonotypes consistently with the original clonotypes from 
the Cell Ranger software. 

Clonotypes were assigned a tissue expansion pattern based on their 
clone sizes in NAT andtumour as follows. Clonotypes having one cell in 
NAT but none in tumour were called NAT singletons, while clonotypes 
having more than one cell in NAT but none in tumour were called NAT 
multiplets. Conversely, clonotypes having one cellin tumour but none 
in NAT were called tumour singletons, and clonotypes having more 
than one cellintumour but none in NAT werecalledtumour multiplets. 
Clonotypes having at least one cell in NAT andat least one cell in tumour 
were called dual-expanded clones. 

Clonotypes from blood samples were classified according to the 
above criteria if they had any cells in NAT or tumour. The remaining 
clonotypes, which had cells only in blood and nonein NAT or tumour, 
were called blood singletons or blood multiplets if they had one cell 
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in blood or more than one cell, respectively. To study the overlap of 
clonotypes between blood and tissue, we also classified clonotypes 
from blood samplesas either blood non-expanded or expanded onthe 
basis of their blood clone sizes being one or more than one, respectively, 
without regard to their NAT or tumour clone sizes. 


Processing of scRNA-seq data 

scRNA-seq data were processed with Cell Ranger software. Illumina 
base call (BCL) files were converted to FASTQ files with the command 
‘cellranger mkfastq’. Expression data were processed with ‘cellranger 
count’ onthe pre-built human reference set of 30,727 genes. Cell Ranger 
performed default filtering for quality control, and produced for each 
sample a barcodes.tsv, genes.tsv, and matrix.mtx file containing counts 
of transcripts for each gene in each single cell, which we parsed using 
ascriptin Rtocreate a count matrix for each sample. Single cells were 
identified by barcodes that were consistent with those from the scTCR- 
seq dataset. In total, sCRNA-seq data were obtained on 541,863,094 
transcripts in 200,626 immune cells. 


Cluster analysis of combined immune cells 

Count matrices for each sample were processed using Seurat using 
the SCTransform procedure with default parameters to perform a 
regularized negative binomial regression based on the 3,000 most 
variable genes. Normalized datasets for the tumour, NAT and blood 
(if available) samples for each patient were combined using the Find- 
IntegrationAnchors and IntegrateData functions in Seurat with the 
default value of 30 dimensions. 

The resulting datasets for the 200,626 immune cells from all 14 
patients were then combined using the FindIntegrationAnchors and 
IntegrateData functions in Seurat with the default value of 30 dimen- 
sions. This process required 130 GB of computer memory. The inte- 
grated dataset was scaled and processed under principal components 
analysis using the ScaleData and RunPCA functions in Seurat. Two- 
dimensional map coordinates were generated using the RunNUMAP 
procedure. Cluster analysis was performed using the FindNeighbours 
procedure and the FindClusters procedureata resolution of 1.6, larger 
than the default value of 0.8, in order to obtain a finer resolution of 
cell subtypes, and yielded 33 clusters of both T and non-T cells. The 
FindMarkers procedure in Seurat was run on each cluster to obtain 
biomarker genes upregulatedin that cluster. The default value of 0.25 
for logfc.threshold was found to generate fewer than 20 genes for some 
clusters, so the procedure was run with a value of 0.10, at the expense 
of greater compute time. 


Computational separation of T and non-T cells 
Clusters identified from all 200,626 immune cells were characterized 
by their mean expression of CD3E and their fraction of cells having a 
clonotype identified from the scTCR-seq assay (Extended Data Fig. 1a). 
T cell clusters were identified as having high levels of both measures, 
and non-T cell clusters were those that had low levels of both measures. 
Some clusters had intermediate values and their biological meaning 
was inferred by their biomarker genes from the FindMarkers proce- 
dure (Supplementary Table 2). Cluster 28 (Extended Data Fig. 1a, c) 
was determined to represent plasma cells based on exclusively high 
expression of MZB1 (encoding pERpI1, plasma cell-induced resident 
endoplasmic reticulum protein)***”. Cluster 17 was determined to 
represent natural killer cells, which had gene expression similar to 
effector T cells, with cytotoxic biomarkers, such as NKG7, GZMB and 
PRF1. However, the origin of cells in this cluster was heavily biased 
towards CD45-selected samples, and the fraction of TCR clonotypes 
was much lower than for other T cell clusters. The 20% of cells with TCR 
clonotypes in this cluster were thought to be from NKT cells having 
gene expression similar to that of natural killer cells. Cluster 29 was 
determined to represent macrophages expressing TCR receptors, a 
phenomenon that has been observed by others*®*”’. The identity of this 


cluster as representing macrophages was evidenced by high expression 
of SAT1, whichis increased in granulocytes and macrophages, but not 
in T,B or natural killer cells, as displayed in the Human Protein Atlas? 
at http://proteinatlas.org/ENSGO0000130066-SAT1/tissue. Clusters 
11, 16, 19, 21 and 27 with intermediate CD3E expression but clonotype 
fractions above 40% were determined to represent T cells (Extended 
Data Fig. 1a, b). 

In support of this separation process, Extended Data Fig. 1i shows 
that CD3-selected samples had relatively few cells classified as non-T 
cells, and that CD45-selected samples all contributed to the collection 
of T cells. Furthermore, subsequent cluster analyses of T and non-T 
cells separately, as described below, confirmed our separation process, 
resulting in no outlier clusters that were classified inappropriately. 


Cluster analyses of T and non-T cells 

On the basis of the above separation of T and non-T cells, the entire 
process of data transformation, integration and cluster analysis was 
repeated on the two divisions separately. The procedure was thesame, 
with two differences. First, the parameter features.to.integrate = all. 
features was given to IntegrateData, where all.features was the inter- 
section of genes from the SCTransform procedure across all patients, 
in order to maximize the genes stored by Seurat into the integrated 
assay, rather than limiting them to the genes stored as var.features. 
Second, the resolution for the FindClusters procedure was set to its 
default value of 0.8. 

One cluster of 3,182 T cells was found to have two discrete com- 
ponents in the UMAP representation, as well as a portion of 60 cells 
that clustered with a mixture of cells from other clusters (the region 
to the left 4.6b-T,,, in Fig. 2a). The latter cells were considered pos- 
sible contaminating or low quality cells and assigned no cluster. A 
division of the remaining cells was made provisionally, applying a 
k-means algorithm on the UMAP two-dimensional coordinates using 
the k-means procedure in R. Because subsequent analyses revealed 
distinct gene expression patterns for the subclusters, the division was 
therefore kept, and constituted the clusters reported as ‘8.4-Chrom’ 
and ‘8.6-KLRBI1’. 


Identification of markers 

To characterize each cluster, we applied both the FindMarkers proce- 
dure in Seurat, which identified markers using log fold changes of mean 
expression, as well as a logistic regression procedure, whichidentified 
biomarkers that separated cells of a given cluster from other cells. The 
logistic regression procedure was performed on each gene within each 
patient for each cluster using the SCTransform assay data, and yielded 
a signed deviance value, where the sign depended on the coefficient 
for the gene expression term, as well as a t-statistic. For each cluster, 
signed deviance values were combined across patients using the Rank 
Product procedure inthe RankProd package inR, and using the result- 
ing one-sided q-values for ranking genes. 


Gene set enrichment analysis 

Gene sets from MSigDB*” v.7.0 were downloaded as Entrez geneiden- 
tifiers in GMT format from http://software.broadinstitute.org/gsea/ 
downloads.jsp and parsed using the getGmt procedure from the GSEA- 
Base package in R. For each cluster from the cluster analysis of T cells 
and each of the 22,596 gene sets, we performed a one-sided z-test on 
thet-statistics from the logistic regression procedure in ‘Identification 
of markers’ section. This statistical test was previously proposed™ as an 
alternative to enrichment scores, provided that genes are given a value 
that has zero mean and unit variance, which should be satisfied theo- 
retically by t-statistics and verified by inspection of quantile-quantile 
plots. To avoid spurious results, we considered only gene sets with five 
or more genes matching those in our scRNA-seq dataset. Inspection 
of results showed high ranking gene sets from clusters ‘8.4-Mitosis’ 
and ‘8.5-Chrom’ against Gene Ontology gene sets corresponding to 


biological processes, and ‘3.1-MT’ against the gene set for mitochondrial 
genes, as shown in Extended Data Fig. 4g. 


Comparison of clusters with external single-cell gene signatures 
We computed reference gene signatures from the scRNA-seq data of 
the datasets for Guo et al.,? Zhang et al.* and Yost et al.°. The first two 
datasets had count matrices that were already normalized by library 
sizes in the form of transcripts per million (TPM) matrices, and we 
computed this same normalization for the third dataset. For each 
dataset, we considered only genes with at least one non-zero TPM value 
across all cells, and computed the log,-transformed expression value 
log,(TPM + 1), where 1 represents a pseudocount. For each cluster, 
based onthe metadata from the original analyses, we obtained robust 
centroids by performing a10% trimmed mean on log-transformed 
expression values for each gene across cells in the cluster. We used 
these centroids as reference gene signatures for the R package SingleR, 
which assigned single-cell assignments to each cell in our dataset. 
Gene expression measurements from our dataset were the cell counts 
computed for each sample before integration, again considering only 
genes with at least one non-zero TPM value across all cellsinthe sample 
and computing the log,-transformed expression value log,(TPM +1). 
We cross-tabulated counts of cells based on both our internal cluster 
assignments and the external assignments computed by SingleR, and 
normalized the resulting matrix first by the external assignments and 
then by our internal assignments to generate heat maps for Extended 
Data Fig. 4a. 


Display of gene expression 

Bar plots in Extended Data Fig. 4b are computed using the scaled data 
from the integrated assay using Seurat. Because the integrated assay 
requires that genes be selected as variable in each of the component 
datasets, some genes could not be depicted this way. In particular, the 
PD1 gene was not included in the integrated assay, so its expressionin 
Extended Data Fig. 4dis taken from counts from the original RNA data. 


Analysis of data from Guo et al.? 

Single-cell RNA-seq counts were obtained using the GEOquery library 
inRandrunning the command getGEOSuppFiles for GSE99254. Expres- 
sion data were obtained over 23,459 genes in 12,348 cells. Metadata 
consisting of the patient, cluster assignment, and tissue source for 
each cell were downloaded from the Gene Expression Omnibus (GEO) 
project GSE99254 (https://www.ncbi.nlm.nih.gov/geo/query/acc. 
cgi?acc=GSE99254). TCR sequences and clonotype groupings were 
obtained from supplementary table 2 of Guo et al.°. TCR sequences 
were obtained for 10,202 cells in 7,398 distinct clonotypes. 


Analysis of data from Zhang et al.* 

Single-cell RNA-seq counts were obtained using the GEOquery library in 
Rand running the command getGEOSuppFiles for GSE108989. Expres- 
sion data were obtained over 23,459 genes in 11,140 cells. Metadata 
consisting of the patient, cluster assignment, and tissue source for 
each cell were downloaded from GEO project GSE108989 (https://www. 
ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE108989). TCR sequences 
and clonotype groupings were obtained from supplementary table 4 
of Zhang et al.*. TCR sequences were obtained for 9,878 cells in 7,274 
distinct clonotypes. 


Signature for resident memory T cells 

Genes were identified as being positively associated (13 genes) or 
negatively associated (18 genes) with resident memory T cells, based 
on Fig. 3a froma published study of tissue-resident versus circulating 
Tcells®. The D45234E gene was identified as asynonym for NSGI1 (Entrez 
ID 27065). Genes were matched against those in the integrated assay 
from Seurat, showing an overlap with four positively associated genes 
(CRTAM, ITGAE, DUSP6 and RGS1) and three negatively associated ones 


(FAM65B, STK38 and KLF2). Although only a few genes showed overlap, 
the published study indicates that genes in this signature share highly 
correlated gene expression. The expression of each gene was converted 
to z-scores by subtracting the mean expression across samples and 
dividing by their standard deviation, and we computed a positive sig- 
nature score for each T cellas the mean z-score from the four positively 
associated genes and a negative signature score similarly from the 
three negatively associated genes. The overall score was the difference 
between the positive and negative signature scores. 


Signature for terminal versus stem-like exhaustion 

Microarray data, in the form of log,-transformed robust multichip 
average (RMA) normalized intensities, were obtained from samples 
GSM2227309 to GSM2227314 from GEO project GSE84105?’ using 
the getGEO procedure of the GEOquery package in R. Differentially 
expressed probes between the three CXCR5* mouse samples and the 
three TIM3* mouse samples were found using the ImFit, contrasts.fit, 
eBayes, and topTable procedures inthe limma package inR, at athresh- 
old log,-transformed fold change of three or more. These probes were 
assigned to genes using the annotate and mouse4302.db packages, and 
then translated into human equivalents using the homologene package 
inR. The resulting 58 CXCR5-associated genes and 42 TIM3-associated 
genes were compared against our scRNA-seq dataset using the scaled 
data array from the integrated assay computed by Seurat, yielding 12 
CXCRS-associated genes (GPR183, CCR6, CRTAM, SLFNS, TNFRSF25,JUN, 
IFI16, SLAMF6, XCL1,IL7R, EMB and SATB1) and 9 TIM3-associated genes 
(ADAM8, PRDM1, CDKN2A, CCL4, CD7, AHR, GPR5S6, GZMA and CISH) 
in common. The expression of each gene was converted to z-scores 
by subtracting the mean expression across samples and dividing by 
their standard deviation, and we computed for each T cell aCXCR5 
signature score as the mean z-score from the CXCRS5-associated genes 
anda TIM3 signature score similarly from the 7/M3-associated genes. 
The overall score for terminal exhaustion versus stem-like exhaustion 
was the difference between the TIM3 and CXCRS signature scores. 


Integration of scRNA-seq and scTCR-seq data 

One issue in integrating scRNA-seq and scTCR-seq data is that the 
clonal expansion pattern information from TCR-seq is applied to 
clones, whereas cluster assignments from RNA-seq are applied to 
individual cells, based on their transcriptional profiles. Therefore, 
a clone may contain a diverse set of cells with different clusters 
(Extended Data Fig. 4h, i). To integrate the data, we assigned tissue 
and blood expansion patterns from clones to their constituent cells. 
Clones were assigned a primary cluster, based onthe cluster with the 
largest representation of cells in the clone. In cases of ties, in which 
the two largest representative clusters had equal counts, we assigned 
no primary cluster to that clone. 

Another consideration is that it was possible for cells to be assigned 
to the non-T cell category based on the scRNA-seq assay, but given a 
clonotype based on the scTCR-seq assay. For analyses of scTCR-seq 
data alone, such cells would be included on the basis of their clonotype. 
Foranalyses of scRNA-seq data alone, such cells would be excluded for 
lacking aT cell cluster. However, for integrated analyses of scRNA-seq 
and scTCR-seq data, the answer is unclear. As indicated in Extended 
Data Fig. 1a, the assignment of clonotypes to non-T cells was relatively 
uncommon in our dataset. In this study, we included such cells ininte- 
grated analyses with a special cluster of ‘non-T’, computed primary 
clusters as described above, and then assigned no primary T cell clus- 
ter to clones that had ties for the two largest representative clusters 
(with one potentially being non-T) or had non-T as the primary cluster. 
Cells were then excluded from the integrated analysis if their parent 
clone lacked a primary T cell cluster. The result of this procedure was 
toinclude non-T cells with a clonotype at the outset, but then exclude 
most of them (including all singletons) because their primary cluster 
was non-T cells. 
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Analysis of data from Yost et al.° 

scRNA-seq counts and single-cell TCR sequences were obtained using 
the GEOquery library in R and running the command getGEOSupp- 
Files for GSE123813. Metadata consisting of the patient, treatment 
status, and cluster assignment for each cell were obtained from the 
corresponding authors for Yost et al.°. We analysed expression data 
that had matching metadata, resulting in 33,106 cells measured on 
23,309 genes. Single-cell TCR sequences were available on 28,371 
cells with matching metadata. Bulk TCR-seq sequences and template 
counts were obtained from the ImmuneACCESS Web portal by Adap- 
tive Biotechnologies (https://clients.adaptivebiotech.com/pub/yost- 
2019-natmed). 

The scTCR-seq data were provided as TCR-a and TCR-f nucleotide 
andamino acid sequences without assigned clonotypes. We therefore 
created clonotype groupings from pre- and post-treatment tumour 
samples by matching all available TCR chains between cells of a given 
patient, using a procedure analogous to our re-grouping of tumour, 
NAT and blood clonotypes for our own dataset. Among the 11 patients 
in the dataset, this procedure identified 17,369 distinct clonotypes. 
Patient bcc.su003 had no clonotype matches between pre-treatment 
and post-treatment tumour samples, and could not be analysed further. 
The remaining patients had pre-treatment and post-treatment bulk 
TCR-seq data from blood for five and six patients, respectively, with 
patient bcc.su002 having no pre-treatment blood sample. 

Bulk TCR-seq clonotypes were linked to tumour clonotypes by match- 
ing the single TCR B-chain to any of the TCR B-chains from a clonotype 
in the scTCR-seq data (Extended Data Fig. 7a). The bulk TCR-seq data 
reports an 87-bp rearrangement sequence from each TCR B-chain that 
includes the CDR3 sequence, so the scRNA-seq CDR3 (sc-CDR3-nt) was 
required to match exactly at the nucleotide level to the corresponding 
substring (bulk-CDR3-nt) in the bulk TCR-seq rearrangement. This 
alignment process was facilitated by the fact that the immunoSEQ 
output includesthe amino acid CDR3 (bulk-CDR3-aa) sequence, which 
we used to identify the corresponding bulk-CDR3-nt. We could then 
create a hash table of all sc-CDR3-nt sequences and use that for lookup 
froma given bulk-CDR3-nt sequence. 

Because scTCR-seq clonotypes can contain TCR a-chains as well 
as multiple TCR B-chains, the potential exists for a many-to-one cor- 
respondence between ascTCR-seq clonotype from tumour anda bulk 
TCR-seq clonotype from blood. For analyses requiring clones to be 
classified by the presence or absence of a bulk TCR-seq match (thatis, 
blood association versus blood independence) (Fig. 3c), all matching 
clonotypes were considered to have an association with blood. For 
analyses requiring matching clone sizes between tumour and blood 
(Fig. 3a, Extended Data Fig. 7b, c), multiple tumour clonotype matches 
for agiven blood clonotype were combined by summing the scTCR-seq 
clone sizes and resolving multiple primary cell clusters by a majority 
vote or leaving the primary cell cluster unassigned in case of a tie. For 
the analysis comparing pre- and post-treatment bulk TCR sequences 
(Extended Data Fig. 7d), clonotypes were matched when the entire 
87-bp rearrangement sequences were identical. 

Because bulk TCR-seq measures transcripts rather than cells, we 
could not discriminate blood-expanded from non-expanded clones, 
although we could distinguish between blood-independent and blood- 
associated clones by the absence or presence of amatching transcript. 
We assumed that if the number of transcripts per cell is relatively 
uniform, the number of transcripts was approximately proportional 
to the number of cells. Therefore, instead of measuring peripheral 
clonal expansion, we measured blood clonal diversity using a Shan- 
non entropy measure, thesum of p x log,(p), over all transcripts with 
acount of50 or more. This threshold was implemented because of the 
heavily skewed distribution of transcript counts, with large numbers 
of singleton transcripts that potentially introduced noise into the 
calculations. 


Completeness curves for Yost et al.° 

We used the transcripts field from each immunoSEQ analysis file for 
bulk TCR-seq from a pre- or post-treatment blood sample as input to 
the iNEXT procedure in the iNEXT package in R, with ‘abundance’ as 
the datatype parameter. The resulting interpolated, observed, and 
extrapolated values were then plotted using the mand SC variables in 
the iNextEst object for each sample (Extended Data Fig. 7e). 


Comparison of TCR sequences with VDJdb 

We obtained the June 2018 (2018-06-04) release of VDJdb™ (https:// 
github.com/antigenomics/vdjdb-db/releases). We analysed the file 
vdjdb.slim.txt to extract entries where the TCR species was ‘Homo- 
Sapiens’ and the epitope species was ‘CMV, ‘EBV’ or ‘InfluenzaA’. We 
compared each clonotype from our data against the cdr3 field in the 
VDJdb database, requiring an exact match from any of the CDR3 amino 
acid sequences for the clonotype. 


Comparison of TCR sequences with TCGA 

We obtained a set of 716,720 TCR sequences assembled from 9,142 bulk 
RNA-seq samples in TCGA using the TRUST algorithm®. We obtained 
those sequences from the authors after gaining authorized access to 
sequence-level data in TCGA. Each sequence was labelled with a puta- 
tive CDR3 amino acid sequence in the header of the FASTA file. We 
compared each clonotype from our data, requiring an exact match 
fromany of the CDR3 amino acid sequences for the clonotype against 
any substring of the TCGA-derived CDR3 sequence. Each FASTA file was 
annotated with an aliquot ID, which we linked to a cancer type using 
theR library GGnomicDataCommons. We considered a TCGA-derived 
CDR3 sequence to be virally reactive when it was observed in three or 
more individuals, all with different cancer types, and no two individuals 
had the same cancer type. 


Gene signatures for tissue expansion patterns 

We applied a logistic regression procedure similar to that used for 
finding characteristic biomarkers of T cell clusters, except that we 
classified cells in tumour tissue on the basis of their parent clones 
beingtumour singletons, tumour multiplets, or dual-expanded clones. 
The logistic regression procedure was performed on each gene within 
each patient for each tissue expansion pattern using the scRNA-seq 
assay data to test all available genes. The library size was included asa 
covariate to help remove artefacts related to sequencing depth. Each 
logistic regression fit yielded a t-statistic for a given gene ina given 
tissue expansion pattern. For each tissue expansion pattern, t-statistics 
were combined across patients using the Rank Product procedure in 
the RankProd package inR, and using the resulting one-sided g-values 
for ranking genes. 

A total of 30 genes were selected for each gene expression signature. 
Because gene signatures are intended for use in bulk RNA-seq data from 
tumours, which containnon-T immune cells and non-immune cells, we 
considered the problem that our genes were selected within a restricted 
universe of T cells and that expression of our genes by non-T cells could 
introduce noise. We therefore examined the co-expression patterns of 
our gene signatures in bulk RNA-seq data from pre-treatment tumours, 
obtained from three clinical trials, as discussed below (Extended Data 
Fig. 9a). We computed the Pearson’s correlation coefficient of z-scores 
for each pair of genes in each signature across patients in all three tri- 
als, and performed hierarchical clustering of the correlation matrix. 
We cut the hierarchical clustering at the first level and used the larger 
of the two groups for the gene signatures. To compute a composite 
score, we used the ¢-statistics from the logistic regression signatures 
as weights. However, the relatively high degree of correlation of gene 
expression meant that weighting had relatively minor effects, and 
we obtained similar results by giving equal weighting to genes in our 
gene signatures. 


Analysis of clinical trial data 

Bulk tumour RNA-seq data were analysed from three clinical trials: 
(1) IMvigor210 (National Clinical Trials identifiers NCTO2951767 and 
NCT02108652)'8, a single-arm trial studying locally advanced or meta- 
static urothelial carcinoma and having 354 patients with bulk RNA- 
seq data; (2) POPLAR (NCT01903993)”, a dual-arm trial comparing 93 
patients with bulk RNA-seq data from non-small-cell lung carcinoma 
treated with atezolizumab and 100 patients treated with the chemo- 
therapeutic agent docetaxel; and (3) IMmotion150 (NCT01984242)”, 
a triple-arm trial of patients with renal cell carcinoma, treated with 
atezolizumab (86 patients), atezolizumab plus the VEGF inhibitor 
bevacizumab (88 patients), or the tyrosine kinase inhibitor sunitinib 
(89 patients). RNA-seq counts were normalized by a trimmed mean of 
M-values, log,-transformed, and converted to z-scores for each gene by 
subtracting the means and dividing by the standard deviation across 
all patients in the trial. Six patients in IMvigor210 had missing values 
for PFS, and could not be included in survival analysis. Subsequent 
clinical analysis was performed using the coxph function from the 
survival package inR. 

PFS hazard ratios were computed for each gene by dichotomizing 
the expression of that gene based on the value being higher or lower 
than the median, and then fitting a Cox proportional-hazards model 
to the censored PFS survival times using the dichotomized variable. 
To evaluate the overall set of hazard ratios for each gene signature, we 
applied the gene set test proposed previously”. Specifically, we first 
confirmed from quantile plots that log-transformed hazard ratios 
were normally distributed. We then found an intersection of genes 
from each signature in a given clinical trial by matching the ENTREZ 
gene IDs. The overall hazard ratio was computed as their geometric 
mean, equivalent to an arithmetic mean of the log-transformed hazard 
ratios. Az-statistic was computed as the mean of the log-transformed 
hazard ratios multiplied by the square root of the number of intersect- 
ing genes, and the Pvalue was obtained by referring this value to the 
normal distribution. 

To evaluate combined signature scores, we found an intersection of 
genes from each signature ina given clinical trial as before. The resulting 
common genes in each signature were used to calculate an expression 
score for each patient in each trial as a weighted mean z-score, using 
mean t-statistics from the logistic regression models to weight each 
gene. The signature score was dichotomized in each clinical trial based 
onits value being higher or lower than the median. A Cox proportional- 
hazards model of this dichotomized variable was fitted to the censored 
PFS survival datain eacharm of each clinical trial, and reported as the 
hazard ratio and P value of the Wald statistic for each Kaplan-Meier 
survival plot. 

To compute interactions between each signature score and CD8A 
expression, each score was dichotomized in each clinical trial based 
on its value being higher or lower than the median. Patients were then 
classified into one of four groups, depending on the combination of 
these dichotomized variables. Survival analysis was performed on 
these groups, with the hazard ratio and P values computed using the 
dual-low group as the control. Interactions between the tumour mul- 
tiplet and dual-expanded signatures were analysed analogously using 
dichotomized values for these signature scores. 


Statistical analyses 

Aside from the clinical analyses described above, the following statisti- 
cal tests were used in this study. (1) Chi-square tests were performed 
using the chisq.test function in R. (2) Post hoc analyses of cells ina con- 
tingency table for over-representation were performed as previously 
described* in which a 2 2 table is created from the counts ina given 
cell and the sum of remaining cells in the same row and column. For 
each cell, a Fisher exact test was applied using the fisher.test function 
in R with alt = “greater” to test the null hypothesis that the observed/ 


expected ratio < 1, in which expected counts derive from the product 
of the row andcolumn marginals. (3) Pearson’s correlation coefficients 
and associated two-sided P values were computed using the cor.test 
function in R to test the null hypothesis that the correlation coefficient 
is zero. (4) Tests of linear regression t-tests were performed using the 
Im functioninR, followed by the summary function to obtain the two- 
sided P value of the null hypothesis that the regression coefficient is 
zero. Insome cases, transformations of the variables were performed 
in addition to linear models, to test the null hypothesis that no rela- 
tionship exists between the variables. (5) t-tests were performed on 
log-transformed clone sizes using the t.test function in R to obtain 
the two-sided P value of the null hypothesis that the two groups have 
equal means. (6) One-samplez-tests were performed using the pnorm 
function in R to test the null hypothesis that values in a group have 
mean < 0 (for the GSEA analysis in Extended Data Fig. 4g, with lower. 
tail set to FALSE) or mean = 0 (for the hazard ratio analysis in Fig. 3a). 
The application of this test for GSEA was previously described**. The 
value provided to pnorm was the mean of the values times the square 
root of the number of cases. For the GSEA analysis, in which biological 
insight was the main goal, no consideration was made for dependency 
among genes. For the hazard ratio analysis, we computed the correla- 
tion matrix R across the 30 genes based on the expression values across 
the patients in each clinical trial. The effective number of cases was 
given by the square root of 1'R1, where ‘I’ indicates a vector of ones. This 
procedure reduced the effective value of n by 20-25%. (7) Comparisons 
of correlation coefficients was performed using a Fisher’s z-test using 
the cocor function from the cocor package in R to test the null hypoth- 
esis that two correlation coefficients from separate samples are equal. 

Nostatistical methods were used to predetermine sample size. The 
experiments were not randomized unless otherwise stated. All fig- 
ures of UMAP plots randomize the order in which cells are plotted. 
Investigators were not blinded to allocation during experiments and 
outcome assessment. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Separation of T and non-T cells. a, Criteria for 
separation. Clusters from the overall cluster analysis of allimmune cells 
assayed by scRNA-seq from the 14 patients in this study. Clusters are plotted 
according to the fraction of cells with a TCR clonotype and the mean 
expression of the T cell marker CD3E from scRNA-seq. Clusters with high values 
on both metrics were considered to represent T cells, whereas those with low 
values on bothrepresent non-T cells. Intermediate values warranted further 
consideration and are annotated with the assigned cluster label for further 
reference. The eventual division of clustersinto T and non-T cells is 
represented by the colours red and blue, respectively. b, Isolation of T cells. 
Immune cells are plotted as dots, positioned by the UMAP dimensionality 
reduction of gene expression. Cells from non-T clusters (blue ina) are in black, 
and cells from T clusters (red in a) are coloured by their subsequent cluster 
shownind. Numbers inbrackets indicate T cell clusters annotated ina.c, 
Isolation of non-T cells. The UMAP plot of b for immune cells is coloured for 


cells from non-T clusters based on their subsequent cluster, as shownineandg. 


d, Origin of T cells. Heat map shows the cross-labelling of T cells by their 
original cluster assignment in the combined analysis of a (rows) and their 
subsequent cluster analysis of T cells separately (columns). Intensities indicate 
the normalized frequency by column. The subsequent cluster is assigneda 
distinct colour, shown in the row labelled ‘new’, matching the schema in Fig. 2a. 
e, Origin of non-T cells. Cross-labelling as ind but for non-T cells. Colours inthe 


row labelled ‘new’ match the schemain h. f, Mixing of T cells across patients. 

T cells are mapped by the UMAP algorithm in a subsequent analysis of the T cell 
division, and are coloured by the patient of origin. Patients are observed to be 
well mixed across the map, indicating adequate integration of the individual 
samples. g, Clonotype fractions for T cell clusters. Bar plot shows the fraction 
of cells witha TCRclonotype for each T cell cluster, coloured according to the 
schemaind.h, UMAP plot of non-T cells. Non-T cells from CD45-selected 
samples are mapped by the UMAP algorithm ina subsequent analysis of the 
non-T cell division, and are coloured by their cluster assigned by cluster 
analysis, using the schema ine. i, Sample statistics. Statistics are provided for 
the cells in our dataset after separationinto T cell and non-T cell categories. 
Patients are labelled by their cancer type: non-small-cell lung adenocarcinoma 
(lung), endometrial adenocarcinoma (endo), colorectaladenocarcinoma 
(colon), and renal clear cell carcinoma (renal). Each patient is annotated by 
whether the cells were selected by CD3 or CD45, and statistics are given 
separately for tumour (T), normal adjacent tissue (N), and peripheral blood (B) 
samples. Numbers indicate the total counts of transcripts and cells from 
scRNA-seq, as well as the count of cells with clonotypes from scTCR-seq in the 
column labelled ‘typed’. Cells were grouped into distinct clonotypes, and the 
count of distinct clonotypes is shown for each patient in the column labelled 
‘clones’. 
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Extended Data Fig. 2 | Clonotype sharing and tissue expansion patterns. a, 
TCR sharing across compartments. Venn diagram for each patient shows 
sharing of TCRclonotypes across compartments. Values indicate the numbers 
of distinct clonotypes unique to each compartment or shared among 
compartments inthe overlapping oval regions. b, Distribution of clones by 
tissue expansion patterns. Bar plot for each patient shows the fraction of 
clones having each tissue expansion pattern. c, Distribution of cells by tissue 
expansion pattern. Bar plots for each patient show the distribution of cellsin 
the NAT (left) and tumour (right) compartments according to the tissue 
expansion pattern of their parent clone. d, Clonal expansion in tissue. Scatter 
plot for each patient shows each distinct clonotypeasa dot, with coordinates 
indicating normalized clone size, or cell fraction, inthe NAT and tumour 
compartments. Dots are coloured by atwo-dimensional palette in the bottom 
right, in which blue shades intensify with increasing NAT clone size, pink 
shades intensify with increasing tumour clonesize, and purple shades intensify 
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withincreasing clone sizes in both compartments. NAT and tumour singletons 
are indicated by yellow and orange, respectively. Vertical and horizontal grey 
lines in eachscatter plot indicate divisions between absence (clone size of 0 
cells) and presence (clone size of lor more cells). Diagonal grey linesindicate 
equal cell fractionsin the two compartments. Numerical values ineach title 
indicate the count of distinct clonotypes in each patient. Two-sided Pvalues are 
froma Pearson’s correlation coefficient ron log-transformed clone sizes from 
the dual-expanded clones (n,). NA indicates that statistics could not be 
computed for two or fewer clones. Patients are ordered by decreasing values of 
r.e, TCRsharing by T cells across compartments. Each patient inais 
represented bya Venn diagram, following the schemain Fig. 1b. Numbers 
within the Venn diagram regions represent counts of T cells by the tissue and 
blood expansion patterns of their parent clone. Numbers to the right of each 
diagram indicate the total number of cells from blood non-expanded and 
expanded clones, used for computing peripheral clonal expansion in Fig. 1c. 
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Extended Data Fig. 3 | Peripheral clonal expansionand tissue infiltrationin 
external dataset. The 14 patients with non-small cell lung adenocarcinoma in 
the dataset from Guo et al.* are each depicted by aset of plots, as in Fig. 1a, c-e. 
Scatter plots of distinct clonotypes areshown, plotted by cell fractions in NAT 
and tumour, with random jitter added to distinguish points. Clone sizein blood 
is indicated by dot size, and clones are coloured by the two-dimensional palette 
for tissue expansion pattern. Vertical and horizontal lines separate the absence 
and presence of clones within compartments. Diagonal lines indicate equal cell 
fractions intumour and NAT. Numerical values denote the extent of parallel 
dual expansion, measured by a Pearson’s correlation coefficient, weighted (r,,) 
by (1+ blood clone size), on the dual-expanded clones (np). Underneath the 
scatter plots, bar plots for the corresponding patient show the extent of 
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peripheral clonal expansion (top), used to order patients, as well as infiltration 
into tissue expansion patterns by blood-independent, non-expanded and 
expanded clones (middle). Pvalues are shown froma chi-square test on counts 
of cells from tumour or NAT (tissue-resident). Additional bar plots (bottom) 
show the fractions of tissue-resident cells with clonotypes observed ina blood- 
expanded clone for each tissue expansion pattern. Two patients (single 
asterisk) had no cells collected from NAT in the original dataset. In addition, 
patients PO616P and PO616A (double asterisks) each had only a single dual- 
expanded clone, soa correlation coefficient could not be computed. The 
remaining ten patients are summarized in Fig. lito show the relationship 
between peripheral clonal expansion and parallel dual expansion in tissue. 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4| T cell subsetsand clonal composition. a, 
Characterizing clusters of T cell clusters with reference gene signatures. Heat 
maps showcross-labelling of T cell clusters (columns) to reference gene 
signatures (rows), taken from the analyses in Guoetal.’, Zhang et al.‘ and Yost 
et al.°, with intensities indicating normalized frequency. CD8 and CD4 clusters 
from Guoetal.’and Zhang et al.‘ are separated by an extra space to aid 
visualization. b, Expression of selected genes. Box plots show distributions of 
gene expression onallT cells in the dataset, with cells grouped by their clusters, 
coloured as in Fig. 2a. Tops and bottoms of boxes indicate interquartile ranges, 
and lines within boxes indicate medians. Whiskers extend an additional 

1.5x the interquartile range from the median. c-f, Characterization of T cell 
clusters. Bar plots show mean values across clusters of various measures on 
Tcells. ‘PD1expr’ denotes expression of PDI (d);‘Termex’ denotes a published 
signature of terminal versus stem-like exhaustion’ (e);‘Trmsig’ denotesa 
published signature of T,,, cells® (c); and ‘Tumour pct’ denotes the fraction of 
cells sampled from tumour versus NAT (f). Horizontal linesind and findicate 
mean values over all cells. g, Gene set enrichment analysis for selected clusters 
and gene sets. The expression of selected gene sets (columns) is shown for 
clusters (rows) by plotting each gene in the gene set that was assayed in the 
integrated dataset as a dot according toits t-statistic froma logistic regression 


analysis to identify biomarkers for each cluster. Gene Ontology genesets 
shown are: histone demethylase activity (HDM); histone methyltransferase 
activity (HMT); mitotic cell cycle (mitosis); and mitochondrial chromosome 
genes (MT). A predominance of dots to the right of the vertical line (t= 0) 
indicates overexpression of the gene set relative to the expected zero mean. 
Statistically significant cases of overexpression are shown in red withthe 
associated genes when a one-sided P< 0.001 froma one-sample z-test on the ¢- 
statistics. h, Transcriptional heterogeneity of T cell clones. Each pie chart 
represents one of the 20 largest clonotypes inthis study, as measured by total 
clone size across tumour, NAT and blood. Each clone represents aset of cells, 
indicating its total clone size, used to order clonotypes. The area of each pie is 
proportional to the clone size. Regions of each pie chart indicate the fractions 
of cells in the given clone assigned to each cluster. i, Composition of clones by 
T cell cluster and compartment. Heat map shows the unit-normalized cellular 
composition of 770 clones with atumour + NAT clone size =10 (columns) across 
Tcell clusters and tumour or NAT compartment (rows). Clones areintegrated 
from all patients and grouped by their primary cluster. Within each primary 
cluster, clones are ordered to showa gradation of cell fraction from tumour to 
NAT. Each clone is further characterized by its clone size (top) and tissue 
expansion pattern (coloured bars above the heat map). 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5| T cell subsets and clonal expansion behaviour. a, 
Tissue expansion patterns of clonotypes by T cell cluster. Bar plot shows the 
distribution of tissue-associated clones—having at least one cellin tumour or 
NAT-—and a primary T cell cluster assigned, grouped by primary cluster. Clones 
ineach primary cluster are further divided by their tissue residency pattern. b, 
Tissue expansion patterns of cells by T cell cluster. Bar plots show distributions 
of T cellsin NAT and tumour compartments, grouped by their assigned cluster. 
The counts in each row, corresponding toa cluster ina, comprise all tissue- 
resident cells—from tumour or NAT—assigned to that cluster. Cell counts are 
further distinguished by the tissue expansion pattern of their parent clone, 
with dual expansion shown onthe right pair of bar plots, and singletons and 
multiplets shown on the left. Pvalueis from a chi-square test on counts of 
tissue-resident T cells. Asterisks indicate statistically significant over- 
representation of the given T cell cluster and tissue expansion pattern, witha 


one-sided Pvalue froma post hoc Fisher exact test onthe same counts of tissue- 


resident T cells as the chi-square test, shown when a Bonferroni-adjusted 
P<0.01.c, Clonal expansion patterns for T cell clusters. Scatter plot for each 

T cell cluster shows tissue-associated clones with the corresponding primary 
cluster, integrated from all 14 patients in this study and plotted by their clone 
sizesin NAT andtumour on logarithmic scales. Dots are coloured by their tissue 
expansion pattern, as per the two-dimensional palette, except that blood 
singleton and multiplet clones were not plotted because only four patients had 
blood samples. d,e, Analysis of external datasets. Thesame methodology ofc 
was applied to datasets from Guo et al.?on14 patients with non-small cell lung 
carcinoma (c) and Zhang et al.* on 12 patients with colorectal adenocarcinoma 
(d). Clones were grouped according to their primary cluster from the original 
analyses, and coloured by the two-dimensional palette for tissue expansion 
pattern at the bottom right of e. Blood clone sizes are indicated by dot size, as 
ine. 
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Extended Data Fig. 6| See next page for caption. 


Extended Data Fig. 6 | Tissue and blood expansion patterns and T cell 
clusters. a, Clonal expansion scatter plots by patient. Data from Fig. laare 
shown, except clones are coloured by their primary cluster (see legend). b, 
Clonal expansion by cluster and tissue and blood expansion pattern. The15 
scatter plots in Fig. 2c are represented as vertical sets of strip charts, witheach 
chart showing theclone sizes in tumour plus NAT for clones in each tissue 
expansion pattern in thescatter plot (abbreviated asn,N,D,T andt, asin 

Fig. 1b). Strip charts are organized by primary cluster and blood expansion 
pattern: blood-independent, blood non-expanded and blood-expanded. These 
one-dimensional representations facilitate the comparison of clone sizes and 
depiction of statistical results. Pvalues are shown froma chi-square test of 
counts of clones. For each strip chart, the observed/expected ratio and one- 
sided P values are shown in red when a Bonferroni-adjusted P< 0.01 from post 
hoc Fisher exact tests of the same counts of clones as the chi-square test. 
Additional statistical tests were performed to compare mean clone sizes 
between the blood-independent, blood non-expanded and blood-expanded 
categories for each tissue expansion pattern in each cluster. Only two tests had 
Bonferroni-adjusted P< 0.01, shownas bars in the 8.1-T.,; dual-expanded 
category, with two-sided Pvalues from at-test on log-transformed clone sizes. 
c, Blood expansion patterns by T cell cluster. Bar plots show the numbers of 
clones in each of the four patients with a blood sample, with clones grouped by 


their primary cluster and further divided by their blood clone size as being 
blood-independent, blood non-expanded or blood-expanded. d, Blood- 
associated expansion by T cell cluster. As inc, except blood-independent 
clones are excluded, and only blood-associated clones are tabulated.e, 
Distribution of T cells in blood by blood expansion pattern. Bar plots show 
numbers of T cells found in blood, grouped according to their cluster and 
further divided by the blood expansion or non-expansion pattern of their 
parent clone. Because parent clones are guaranteed to have the givencellin 
blood, blood independence is not possible. f, Two-dimensional map of cells in 
blood by peripheral clonal expansion. Cells from blood witha clonotype are 
plotted onto the UMAP coordinates from Fig. 2aand coloured greenifnon- 
expanded in blood (blood clone size =1) or ashade of brown for increasing 
expansion in blood. Ovals from Fig. 2a are added for reference. g,i, Distribution 
of T cell clusters in tumours (g) and NAT (i) by blood expansion pattern. As ine, 
except for cells in tumour (g) and NAT (i) from the four patients with blood 
samples. Pvalues are froma chi-square test of counts of cells over T cell clusters 
in blood versus counts over T cell clusters in tumour (g) or NAT (i). h,j, Two- 
dimensional maps of cells intumour (h) and NAT (j) by blood expansion 
pattern. As inf, except for cells intumour (h) and NAT (j) from the four patients 
with blood samples. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | Sharing of novel clones intumour with clonotypes in 
blood. a, Matching bulk TCR-seq and scTCR-seq clonotypes. An example from 
the Yost et al.° dataset is shown to illustrate issues in matching clonotypes 
across bulk and single-cell technologies. Bulk TCR-seq from Adaptive 
Biotechnologies immunoSEQ technology yields single 87-base-pair segments 
of individual B-chains, whereas scTCR-seq from 10x Genomics potentially 
yields combinations of a- and B-chain CDR3 sequences per clonotype, 
indicated here by four clonotypeIDs and associated sequences in grey boxes. 
The immunoSEQ output also provides a CDR3 amino acid sequence (bulk- 
CDR3-aa, rectangle) for productive B-chains, which we used to facilitate 
matching. We considered clonotypes to matchif either B-chain CDR3 from 
scRNA-seq aligned exactly to the bulk TCR-seq sequence at the nucleotide 
level, at the position consistent with bulk-CDR3-aa. a-chain CDR3 sequences 
were disregarded in this process. All four clonotypes shown were therefore 
considered matches to the bulk TCR-seq sequence. For the purpose of counting 
Tcells,a sum was taken over all matching scRNA-seq clonotypes. Further 
considerations are provided in Methods. b, Correlation of tumour and blood 
clone sizes in novel CD8 clones. Scatter plots are shown for each patient in Yost 
et al.° that had both single-cell RNA-seq and TCR-seq of tumour-infiltrating 
lymphocytes in pre-and post-treatment tumours as well as bulk TCR-seq of 
Tcells in blood. Dots represent novel CD8 clones based on the primary post- 
treatment cluster from the original analysis. Novel clones are plotted by the 
count of transcripts in pre-treatment blood (resorting to post-treatment blood 
for bcc.su002, which lacked a pre-treatment blood sample), used as a proxy for 
blood clone size, and clone size in post-treatment tumour. Vertical bar 


separates novel clones matching a clonotype in blood (blood-associated, right) 
from those that did not (blood-independent, left). Two-sided Pvalues are 
shown fora Pearson’s correlation coefficient ron blood-associated novel 
clones. Patients are ordered by their total (blood-associated plus blood- 
independent) number of novel CD8 clones. Two-sided Pvalues areshown from 
a Fisher’s z-test for the comparison of the correlation coefficient of CD8 novel 
clones and the correlation coefficient of the CD4 novelclones.c, Correlation of 
tumour and blood clone sizesin novel CD4 clones. Scatter plots are shown for 
the novel CD4 clones from patients in a, in corresponding order, as inb.d, 
Clonal diversity in blood. Scatter plots are shown for the patientsinbandc, in 
corresponding order. Dots represent distinct TCR B-chain rearrangements as 
provided in the original immunoSEQ analysis, plotted by the numbers of 
templates reported in pre-and post-treatment blood. For patient bcc.su002, 
whichlacked a pre-treatment blood sample, a one-dimensional strip chart 
shows the post-treatment TCR repertoire with horizontaljitter added to 
display points more clearly. Increasing clonal diversity can be observed 
qualitatively as the increasing presence of clones along the main diagonal, and 
is quantified using Shannon entropy. e, Completeness curves for blood TCR- 
seq samples. Each plot shows asample completeness curve for a bulk TCR-seq 
sample in d based ona rarefaction and extrapolation analysis*®, with pre- and 
post-treatment samples coloured as shown. Each curve indicates the estimated 
coverage of the total set of TCR B-chain rearrangements asa function of the 
total number of transcripts sampled. Dot indicates the actual number of 
transcripts sampled, solid lines indicate the interpolated completeness curve, 
and dashed lines indicate an extrapolation of the completeness curve. 
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Extended Data Fig. 8 | Matching clonotypes against databases of knownand 


putative virally reactive TCRs. a, TCRrepertoire of clonotypes matching 


VDJdb. Aset of rug plots is shown for each patient, with each plot representing 
the repertoire of clonotypes matching TCRs listed as reacting against common 


viral antigens from the VDJdb database™. Viral antigens shown are from 
cytomegalovirus (p65 antigen), Epstein-Barr virus (BMLF1, EBNA3) and 
influenza A (M1). Other antigens from these viruses are listed as ‘other’. Each 
rug plot depicts each distinct clonotypeasa region, coloured by its primary 
cluster, with the height of each region indicating its total clone size intumour 
plus NAT. Clonotypes are stacked on top of one another in random order. 

In situations in which adjacent clones share the same colour, black lines were 
used to separate them, when both clones hadaclone count greater than 5, 
indicating a need to resolve them visually. Plots show that most matching 
clonotypes were singletons, but that patients often had afew virally reactive 
clonotypes that had expanded greatly. b, Association of viral reactivity with 
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Extended Data Fig. 9 | Gene signatures of tissue expansion patterns and 
relationship to CDSA expression. a, Consistency of gene signatures in bulk 
tumour RNA-seq. The 30 highest-ranking genes for each tissue expression 
pattern involving a tumour sample are shown ina heat map of correlated gene 
expression from bulk tumour RNA-seq data fromall patients in the three 
clinical trials analysed in this study. Intensities of each cell represent the 
Pearson’s correlation coefficient on the gene expression values from patients 
between each pair of genes, and dendrograms indicate the hierarchical 
clustering of genes. The first division of each dendrogram is used to eliminate 
genes that areinconsistent with other genes in the signature, possibly due to 
expression by non-T cells. b, Expansion signatures in scRNA-seq data. Heat map 
shows the relative gene expression of signatures fromainthe scRNA-seq data 
of this study, used for the initial selection of gene signatures. Intensities 
indicate the meanlog,-transformed fold change of each gene (rows) across 
patients (columns), in which the fold change was computed within each patient 
for the T cells from the tumour compartment ofa given cluster against all other 
Tcells from the tumour compartment from that patient. ‘Consistent’ indicates 
genes that passed the filtering step ina by black cells, which are used for 
subsequent analysis. Genes commonto more than one signature are marked by 


anasterisk. c, Correlation of expansion signatures with CD8A expression. 
Scatter plots show the correlation of CD8A expression with expansion 
signature scores across patient bulk RNA-seq samples from three clinical trials. 
Expression of CCLSis also included asa marker of expansion, ranking highly in 
boththe tumour multiplet and dual expansion signatures from the scRNA-seq 
analysis. Each dot represents a pre-treatment bulk tumour RNA-seq sample, 
coloured by the clinical trial.d, Survival analysis of CD8A expression. Kaplan- 
Meier plots of PFS are shown for each arm ina clinical trial, with patients 
dichotomized by their expression of the CDSA gene in bulk tumour above 
(CD8A"®") or below (CD8A’) the median expression amongall patients in the 
corresponding clinical trial. CD8A expressionis used as a marker for the 
prevalence of intratumoural CD8' T cells, which is known to bea predictor of 
response to cancer immunotherapy. Censored observations are indicated bya 
plus symbol. Hazard ratios (HR) and two-sided P values froma Cox 
proportional-hazards model on patients in both groups are shown, highlighted 
in red with the associated survival curve when P< 0.05. Six patients in 
IMvigor210 were omitted owing to missing values for PFS.e, Asind, except for 
gene expression of CCLS. 
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Extended Data Fig. 10 | Survival analysis. a, Survival based on clonal 
expansion patterns. Kaplan-Meier survival curves for PFS are shown for three 
clonal expansion signatures (rows) ineacharm of aclinical trial (columns), in 
which patients are dichotomized by scores above (dotted lines) and below 
(solid lines) the median in each clinical trial. Plus symbols indicate censoring 
events. Hazard ratios (HR) and two-sided Pvalues from a Cox proportional- 
hazards model on patients in both groupsare shown, highlighted in red with 
the corresponding survival curve when P< 0.05. b, Survival based on expansion 
signatures inthe context of CD8A expression. Kaplan-Meier plots for PFS are 
shown using both CD8A expression and expansion signature scores (rows) in 
eacharm of aclinical trial (columns). Patients in each clinical trial were divided 
into four groups based on CDSA expression above (CD8A"®*") or below (CD8A'™) 


the median and onexpansion signature score above (signature"®") or below 
(signature'™) the median amongall patients in the corresponding clinical trial. 
Patients with low CD8A expression and low expansion signature score were 
usedasacontrol foreach of the other three groups. Hazard ratios and two- 
sided Pvalues are froma Cox proportional-hazards model on patientsin each 
group, highlighted in red with the corresponding survival curve when P< 0.05. 
c, Survival based on dual-expanded and tumour multiplet signatures. Kaplan— 
Meier survival curves are plotted asin b, except that patients were divided into 
four groups based on dual-expanded clone signature above (D"®") or below 
(D'’) the median and on tumour multiplet signature above (T"®") or below (T'™) 
the median amongall patients in the corresponding clinical trial. 
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Data collection Data were collected using Cell Ranger software (10x Genomics, Pleasanton, CA) version 2.2.0 [Zheng 2017]. 


Data analysis Data were analyzed using Perl version 5.18.2, R version 3.6.0, and the following packages and versions in R for analysis: Seurat, 3.0.2 
Stuart 2019]; SingleR, 1.0.1 [Aran 2019]; RankProd, 3.11.0 [Del Carratore 2017]; GSEABase, 1.47.0; limma, 3.41.15 [Ritchie 2015]; 
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2014]; and survival, 2.44-1.1 [Therneau & Grambsch 2000]. Two-dimensional gene expression maps were generated using coordinates 
rom the Uniform Manifold Approximation and Projection (UMAP) algorithm [McInnes and Healy 2018, Becht 2019], as implemented in 
he umap-learn module version 0.3.10 in Python version 3.6.0, run under R using reticulate version 1.13. Figures were produced using 
he following packages and versions in R: colorspace, 1.4-1 [Zeileis 2019]; RColorBrewer, 1.1-2; pheatmap, 1.0.12; and superheat, 0.1.0 
Barter & Yu 2018]. External datasets were obtained using GenomicDataCommons, 1.9.0 [Morgan & Davis 2017] and GEOquery, 2.53.0 
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- Accession codes, unique identifiers, or web links for publicly available datasets 
- Alist of figures that have associated raw data 
- Adescription of any restrictions on data availability 


FASTQ files containing raw reads from the scRNA-seq and scTCR-seq analyses have been deposited with the European Genome-phenome Archive (GA) under 
studies EGASO00001003993 and EGASO0001003994, and datasets EGAD00001005464 and EGADO00001005465. These files are available under controlled access 
upon request to the Data Access Committee, with contact information provided at the corresponding Web page at EGA. Processed output files from Cell Ranger, 
integrated assay results from Seurat, and metadata with UMAP coordinates, cluster assignments, and clonotypes are available from the NCBI Gene Expression 


Omnibus database under accession GSE139555, which also provides computer code used to generate the analyses and figures in this paper as a Supplementary File. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample size of patients was determined in part by the exploratory nature of our study, the novelty of the 10x Genomics single-cell TCR-seq 
assay technology, and the relative difficulty of obtaining surgical samples. However, our study of 14 patients is comparable to published 
studies of single-cell RNA-seq and single-cell TCR-seq data, with 2018 studies having 12-14 patients using another technology (SmartSeq2). 
Furthermore, our dataset has greater sequencing depth than published studies, providing evidence that our sample sizes of cells and clones 
should be adequate for comparable analysis of T cell subtypes and clonal expansion. Sample sizes for clinical trial analyses were determined 
by the designers of the clinical trials. 


Data exclusions — Single cells from CD3- and CD45-selected samples were separated computationally into T and non-T cells, as described in Methods and 
documented in Extended Data Fig 1a-g. Patient bcc.su003 from Yost et al., 2019 showed no overlapping clones between pre- and post- 
treatment tumour, and could not be analyzed further, as described in Methods. 


Replication We tested our script for identifying clonotypes against the assignment from CellRanger software for each patient. We compared our cluster 
analysis of T cell subsets against the clusters obtained by Guo et al., 2018, Zhang et al., 2018, and Yost et al., 2019. We confirmed our 
relationship between peripheral clonal expansion and infiltration, as well as the correlation between blood clone size and tumor clone size, 
using 14 patients from Guo et al., 2018 and 12 patients from Zhang et al., 2018. We observed relationships between T cell subsets and clonal 
expansion patterns similar to ours using the same two datasets. We observed a correlation between blood transcript counts and novel 
tumour clone size on two patients in Yost et al., 2019 with bulk RNA-seq measurements in blood and the largest numbers of novel clones. 

We confirmed our hypotheses regarding the replenishment of T cells from blood using 6 patients from Yost et al., 2019. We confirmed our 
analysis of viral reactivity using two datasets: VDJdb and TCGA-derived TCR sequences. We performed survival analysis on gene signatures for 
tissue expansion patterns using three independent clinical trials. 


Randomization — Single-cell TCR data were grouped using T cell receptor chains and CDR3 nucleotide sequences assigned by the Cell Ranger software package. 
Single-cell RNA-Seq data were grouped using existing classification algorithms from the Seurat package. All two-dimensional gene expression 
maps plotted cells in random order. Random jitter was added to scatterplots to visually display all points. 


Blinding Patient samples were obtained from commercial vendors, who were blinded to the scientific purpose of the samples. Sample preparation 
was performed by laboratory personnel who were blinded to the subsequent computational analysis. Exploratory computational analyses 
were performed in an unsupervised manner, with analyses blinded to preconceived notions of the expected outcome. Statistical tests were 
performed without bias about expected results. Clinical data were obtained from randomized clinical trials that blinded both patients and 
practitioners about their treatment arms. 
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Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
Antibodies Flow cytometry 
Eukaryotic cell lines MRI-based neuroimaging 
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Animals and other organisms 


Human research participants 


Unique biological materials 
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Obtaining unique materials | Samples no longer available, because they were consumed in the preparation for analysis 


Antibodies 


Antibodies used EpCAM: Biolegend 9C4 antibody, Cat# 324208, Lot# B198775 , fluorochrome APC. CD45: Biolegend 2D1 antibody, Cat# 368516, 
Lot# B264243, fluorochrome APC/Cy7. CD3: BD Biosciences SK7 antibody, Cat# 564001, Lot# 6109971, fluorochrome BUV395. 


Validation EpCAM (Biolegend 9C4 antibody): Product information: https://www.biolegend.com/en-us/search-results/apc-anti-human- 
cd326-epcam-antibody-3758 
CD45 (Biolegend 2D1 antibody): https://www. biolegend.com/en-us/search-results/apc-cy7-anti-human-cd45-antibody-12400 
CD3 (BD Biosciences SK7 antibody): https://www.bdbiosciences.com/us/applications/research/t-cell-immunology/th-1-cells/ 
surface-markers/human/buv395-mouse-anti-human-cd3-sk7-also-known-as-leu-4/p/564001 
Certificates of analysis for Biolegend lots are available at http://biolegend.com/it-it/certificate of analysis. A quality certificate 
for the BD Biosciences lot is available at http://regdocs.bd.com/regdocs/searchCOAAction.do. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics The 14 patients studied by single-cell TCR and scRNA-seq are as follows: Lung1: age 65, female. Lung2: age 60, female. Lung3: 
age 51, female. Lung4: age 67, female. Lung5: age 71, female. Lung6: age 55, female. Endo1: age 59, female. Endo2: age 62, 
female. Endo3: age 61, female. Colon1: age 47, female. Colon2: age 80, female. Renali: age 65, male. Renal2: age 45, female. 
Renal3: age 62, male. Population characteristics of clinical trial subjects are presented in their corresponding publications. 


Recruitment Patient samples were procured from third parties (Discovery Life Sciences, iSpecimen Inc., Avaden BioSciences, and TriMetis Life 
Sciences) from adult patients undergoing resection surgery. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation Fresh surgical samples were separated into tumour and NAT compartments by the reviewing pathologist and shipped overnight 
to our institution. Upon arrival, samples were rinsed with phosphate-buffered saline (PBS) until no traces of blood were visually 
detected. Subsequently, samples were digested with a combination of Collagenase D (0.5mg/ml) and DNAse (0.1mg/ml) for 
15min at 37°C with gentle shaking. Subsequently, samples were subjected to a gentleMACS dissociator (Miltenyi Biotec), 
followed by an additional 10 min incubation at 37°C. Peripheral blood mononuclear cells (PBMCs) from patient blood samples 
were isolated using 50mL Leucosep™ tubes (Greiner Bio-One International, Germany) and Ficoll-Paque™ PLUS (GE Healthcare, 
Sweden). Whole blood drawn into sodium heparin blood collection tubes were diluted 3 times with PBS without calcium or 
magnesium (Lonza, Walkersville, MD). Diluted cell suspensions were carefully layered on Leucosep tubes and centrifuged for 15 
minutes at 800 x g at room temperature (RT). Interphase-containing PBMCs were harvested and washed with PBS and 
subsequently centrifuged for 10 minutes at 250 x g at RT before further processing. 


Instrument Cells were purified by fluorescence-activated cell sorting (FACS) on a Becton Dickinson FACSAria Fusion cell sorter equipped with 


Instrument 4 lasers (405nm, 488nm, 561nm, 638nm). A 70-micron nozzle running at 70 psi and 90kHz was used as the setup for each sort 
session. 


Software FACSDiva (version 8.0.1) and FlowJo (version 10) were used to collect and analyze the flow cytometry data. 


Cell population abundance __Post-sort samples were analyzed for purity using the same FACS sorter. Using the same gating strategy , the purity of the 
samples was determined and calculated to be 97-99% 


Gating strategy Before gating on fluorescence, live, single cells were gated using FSC-A and SSC-A (for intact cells) and SSC-W/SSC-H and FSC-W/ 
FSC-H (to ensure that only singlets were sorted). FACS gates were drawn to include only live single cells based on Calcein Blue 
AM+ and 7-AAD (Thermo Fisher Scientific). Further gates were drawn to arrive at CD3+CD45+EpCAM- (for CD3+ selected 
samples) or CD45+EpCAM- cells (for CD45+ selected samples) . Boundaries between positive and negative cell fractions were 
drawn based on single-color stains. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Although itis well-established that reductions in the ratio of insulin to glucagon inthe 
portal vein have a major role in the dysregulation of hepatic glucose metabolism in 
type-2 diabetes’ ’, the mechanisms by which glucagon affects hepatic glucose 
production and mitochondrial oxidation are poorly understood. Here we show that 
glucagon stimulates hepatic gluconeogenesis by increasing the activity of hepatic 
adipose triglyceride lipase, intrahepatic lipolysis, hepatic acetyl-CoA content and 
pyruvate carboxylase flux, while also increasing mitochondrial fat oxidation—all of 
which are mediated by stimulation of the inositol triphosphate receptor 1 (INSP3R1). 
In rats and mice, chronic physiological increases in plasma glucagon concentrations 
increased mitochondrial oxidation of fat in the liver and reversed diet-induced 
hepatic steatosis and insulin resistance. However, these effects of chronic glucagon 
treatment—reversing hepatic steatosis and glucose intolerance—were abrogated in 
Insp3rl1 (also known as /tpr1)-knockout mice. These results provide insights into 
glucagon biology and suggest that INSP3R1 may represent a target for therapies that 
aim to reverse nonalcoholic fatty liver disease and type-2 diabetes. 


On the basis of the clear role for glucagon in the pathogenesis of 
diabetes, glucagon-blocking therapies have been pursued as adjuncts 
to therapies for this condition. Glucagon antagonism‘ ° and knock- 
down of the glucagon receptor” have shown glucose-lowering effects; 
however, these agents increase liver enzymes’* by anunknown mecha- 
nism. Conversely, a dual agonist for glucagon-like peptide-1 and gluca- 
gon receptor increases energy expenditure and promotes weight loss, 
an effect associated with reductions in blood glucose”. Together, these 
data suggest a role for glucagon in promoting hepatic mitochondrial 
fat oxidation, in addition to its known effect of stimulating hepatic 
gluconeogenesis. 

Glucagon stimulates hepatic glycogenolysis and gluconeogenesis, 
the latter of which is believed to occur largely through transcriptional 
regulation. Hepatic calcium signalling is integral to the transcriptional 
regulation of gluconeogenesis: inhibition or deletion of liver Ca”*/ 
calmodulin-dependent protein kinase II (CAMKII) results in reduced 
expression of proteins in the gluconeogenic pathway in mouse livers 
and associated reductions in plasma glucose and insulin concentra- 
tions’>®, Type-1 INSP3R is the isoform that is primarily responsible for 
mitochondrial calcium signalling in hepatocytes"®. It has previously 
been shown" that knocking down /nsp3rl1 reduces glucose production 
inisolated hepatocytes; however, no studies have fully characterized 
the role and mechanism of INSP3R signalling in glucagon-stimulated 
hepatic gluconeogenesis in vivo. 

To examine the potential calcium dependence of the acute response 
to glucagon, we studied mice with /nsp3rl knocked out specifically 
in the liver, along with their weight-matched wild-type littermates 


(Extended Data Fig. 1a). The /nsp3rl-knockout mice exhibited a marked 
reduction in hepatic INSP3R1 protein expression (Extended Data 
Fig. 1b; full gels for all blots are shown in Supplementary Fig. 1). We 
examined subcellular fractions in primary hepatocytes and found that 
INSP3RI1 was associated with mitochondria, whereas thetype-2 INSP3R 
isoform (INSP3R2) was not; the type-3 INSP3R isoform (INSP3R3) was 
not detected at all (Extended Data Fig. 1c), consistent with previous 
data that show that INSP3R1 is primarily responsible for mitochondrial 
calcium signals!®. We then treated mice—which were fasted overnight, 
and therefore depleted of glycogen—with an acute infusion of glucagon, 
which modestly increased plasma glucose and insulin concentrations in 
wild-type—but not /nsp3rl-knockout—mice and increased phosphoryla- 
tion of INSP3R1 (Fig. la—c, Extended Data Fig. 1c). Phosphorylation of 
CREB-regulated transcriptional coactivator 2 (CRTC2) was decreased 
with glucagon infusion in both wild-type and /nsp3rl-knockout mice 
(Extended Data Fig. 1d), consistent with previous literature. Glucagon 
increased cAMP concentrations and the activity of protein kinase A 
(PKA) in both wild-type and /nsp3rl-knockout mice, dissociating these 
mediators from glucagon-induced changes in plasma glucose con- 
centrations (Fig. 1d, e). Consistent with a previous report”, glucagon 
stimulated the phosphorylation of CAMKII but not of CAMKIV in an 
INSP3R1-dependent manner (Fig. 1f, Extended Data Fig. le). These data 
imply that INSP3R1 activation and CAMKIl activity arerequired forthe 
acute effect of glucagon in stimulating hepatic glucose production 
(HGP), which can be attributed almost entirely to gluconeogenesis in 
this hepatic glycogen-depleted state’’. Indeed, we measured increases 
of approximately 25% in in vivo HGP, which could be attributed to an 
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Fig. 1|Glucagon acutely stimulates hepatic gluconeogenesis by increasing 
hepatic acetyl-CoA content and V,,.a-c, Plasma glucose, insulin and glucagon 
concentrations before and at the end of a2-hintravenous infusion of glucagon 
(n=7).d-f, Hepatic cAMP concentrations, PKA activity and CAMKII 
phosphorylation (pCAMKII) (n=5, with the exception of wild type without (-) 
glucagon ine (n=4)). Blots inf, and in Fig. 2a, Extended Data Figs. 1b,c,e, 3f,g,4a 
were stripped and reprobed for all proteins of interest. g,h, HGP (n=6 wildtype 
and 5 knockout) and V,, (n=5 wild type and 6 knockout). i,j, Hepatic long-chain 
acyl- (n=5- glucagon, 6 with (+) glucagon) and acetyl-CoA content (n=6). Inall 
panels, mean+s.e.m. isshown. Groups were compared before and after 
glucagon treatment (a-c, gand h) by two-tailed paired Student’s t-test, and 
separate mice (+glucagonind-f, i,j, and wild-type versus knockout mice inall 
panels) were compared bytwo-tailed unpaired Student’s t-test. Alln values refer 
tonumbers of mice. 


increased rate of hepatic pyruvate carboxylase flux (V,,) that was 
associated with amino acid depletion in the liver without alterations 
in hepatic glycogen content (Fig. 1g, h, Extended Data Fig. 1f-m, Sup- 
plementary Tables 1, 2). The in vivo HGP data were mirrored by twofold 
increases in both HGP and V,, inisolated hepatocytes from wild-type 
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Fig. 2 | Glucagon requires INSP3-mediated intrahepatic lipolysis to promote 
V,-and HGP. a, Phosphorylation of ATGL at Ser4.06 (n=5). The blots here, as well 
as those in Fig. 1f, Extended Data Figs. 1b,c, e, 5f, g, 7a, were stripped and 
reprobed for all proteins ofinterest.b, Glucose productionin hepatocytes 
(n=6 mice per group) incubated with the ATGL inhibitor atglistatin and/or 
glucagon. ****P< 0.0001 versus wild type- glucagon — atglistatin, §§§§P<0.0001 
versus wild type + glucagon — atglistatin.c, d, Plasma glucose and insulin 
concentrations in mice treated with an adeno-associated virus to knock down 
Atglinaliver-specific manner. In c-h, n=6 wild type, 5 wildtype + Atgl 
knockdown and 6 knockout + Atgl knockdown. e, f, Liver long-chain acyl- and 
acetyl-CoA concentrations following a 2-h glucagon infusion. g,h, HGP and Vpc. 
Inall panels, mean+s.e.m. is shown. Groups were compared before and after 
glucagon treatment (c,d) by two-tailed paired Student’s ¢-test, and groups inall 
other panels (as well as the four separate groups inc, d) were compared by 
two-tailed unpaired Student’s ¢-test. All values refer tonumbers of mice. 


but not /nsp3rl-knockout mice incubated with glucagon (Extended Data 
Fig. 1n, 0). The ability ofinsulin to suppress HGP and V,, was unaltered 
in Insp3rl-knockout hepatocytes, and malic enzyme activity was dis- 
sociated from glucose production by treatment with a malic enzyme 
inhibitor (Extended Data Fig. 1p-s). Finally, in vivo studies with soma- 
tostatin and replacement insulin infusion to mimic basal insulin con- 
centrations in the portal vein (85-90 pM in both genotypes) revealed 
that changes in plasma insulin could not explain the observation that 
glucagon stimulated HGP and V,, only in wild-type mice (Extended 
Data Fig. 1t-x). 

To further explore the calcium-dependent mechanism by which 
glucagon stimulates HGP and V,,, we treated hepatocytes with two 
phospholipase C (PLC) antagonists (ET-18-OCH3 or U-73122) and found 
that both agents inhibited glucose production and V,, in wild-type— 
but not /nsp3rl-knockout—hepatocytes (Extended Data Fig. 2a—d). 
Similarly, a PKA inhibitor (H-89) abrogated the ability of glucagon to 


stimulate glucose production and V,, only in wild-type hepatocytes, 
which demonstrates that both the PLC and PKA pathways are required 
to activate gluconeogenesis in response to glucagon via INSP3R1 sig- 
nalling (Extended Data Fig. 2e, f). To specifically confirm the role of 
INSP3R1 in promoting HGP, we treated hepatocytes with vasopressin 
(an activator of the INSP3 receptor) and found that this agent reca- 
pitulated the effect of glucagon to increase glucose production and 
Vpc (Extended Data Fig. 2g, h). However, small-molecule inhibitors of 
INSP3 (2-aminoethoxydiphenyl borate (2-APB) and caffeine), CAMKII 
and CAMKIV (KN-93) and the SERCA pump (thapsigargin) inhibited the 
ability of glucagon to stimulate glucose production and V,, (Extended 
Data Fig. 2i-p), which suggests that maintaining a normal balance of 
calcium throughout the cell is required for normal INSP3R1 signalling. 
None of these agents had any effect in /nsp3rl-knockout mice, which 
indicates that INSP3R2 and INSP3R3 donot restore glucose production 
in the absence of INSP3R1in the liver. 

The observed increases in the rate of HGP occurred in the absence 
of any effect of glucagon to acutely increase the expression of mRNAs 
or proteins associated with hepatic gluconeogenesis in vivo. However, 
glucagon infusion increased hepatic long-chain acyl-CoA and acetyl- 
CoA content by 35-60% (Fig. 1i,j, Extended Data Fig. 3a—e). These 
increases in hepatic long-chain acyl- and acetyl-CoA concentrations 
were dissociated from the phosphorylation of both acetyl-CoA car- 
boxylase (ACC) and 5S’ AMP-activated protein kinase (AMPK), both of 
which were increased by glucagon in /nsp3r1-knockout, but not in wild- 
type, mice (Extended Data Fig. 3f, g). The differences in ACC and AMPK 
phosphorylation between genotypes probably reflect the interplay 
between the ability of glucagon to promote the phosphorylation of 
both of these enzymes and the ability of hyperinsulinaemia (which was 
observed only in wild-type mice infused with glucagon) to suppress this 
phosphorylation. Hepatic long-chain acyl-CoA and acetyl-CoA content 
increased in wild-type mice infused with glucagon despite reduced 
plasma non-esterified fatty acids (NEFA) concentrations (Extended 
Data Fig. 3h), consistent with glucagon stimulation of intrahepatic 
lipolysis but not white adipose tissue lipolysis. As acetyl-CoA is an allos- 
teric activator of pyruvate carboxylase (PC), the increase in hepatic 
acetyl-CoA content observed with glucagon infusion could explain 
theincreasesin V,,and HGP that occurred after glucagon treatmentin 
wild-type mice. These changes occurred independently of any changes 
in hepatic malonyl-CoA content (Extended Data Fig. 3i), whichindicates 
that malonyl-CoA suppression of carnitine palmitoyl transferase I was 
not responsible for glucagon stimulation of hepatic mitochondrial 
B-oxidation under these conditions. 

To further understand the physiological role of glucagon-induced 
HGP, we fasted wild-type and /nsp3r1-knockout mice for 48 h. In the 
starved state and despite 70-90% increases in plasma glucagon con- 
centrations in the tail vein and portal vein, /nsp3r1-knockout mice 
manifested lower plasma glucose and insulin concentrations that 
were associated with reductions in hepatic long-chain acyl-CoA and 
acetyl-CoA content and no change in hepatic malonyl-CoA content 
as compared to wild-type mice (Extended Data Fig. 3j-o). These data 
demonstrate a critical role for INSP3R1 in the maintenance of eugly- 
caemia during starvation. 

We next hypothesized that INSP3RI1 stimulation of intrahepatic 
lipolysis may explain the observed increases in hepatic acetyl-CoA 
contentand V,, after glucagon treatment. Consistent with this hypoth- 
esis, phosphorylation of hepatic adipose triglyceride lipase (ATGL) 
at Ser406—which has previously been shown to regulate the activity 
of ATGL”—was increased threefold by glucagon in wild-type, but not 
Insp3r1-knockout, mouse livers (Fig. 2a). By contrast, phosphorylation 
of hormone:sensitive lipase (HSL) was increased by glucagon in both 
genotypes, which dissociates HSL activity from glucagon activation of 
HGP and V,, (Extended Data Fig. 4a). In vitro studies revealed 60-100% 
increases in NEFA and glycerol production with glucagon treatment 
in wild-type, but not INSP3R1-deficient, hepatocytes (Extended 


Data Fig. 4b, c). Plasma NEFA concentrations were unchanged by gluca- 
gon infusion in mice treated with somatostatin (Extended Data Fig. 4d). 
These data suggest that although glucagon may promote lipolysis 
when infused at markedly supraphysiological doses”, in the setting of 
physiological concentrations of glucagon and intact B-cell function, 
glucagon does not directly affect white adipose tissue lipolysis” ”. 
Confirming the requirement for glucagon-stimulated intrahepatic 
lipolysis to promote gluconeogenesis, glucagon had no effect in hepato- 
cytes treated with a small molecule inhibitor of ATGL (atglistatin). 
Similarly, INSP3 agonism recapitulated the effect of glucagon in terms 
of stimulating intrahepatic lipolysis, and inhibitors of INSP3, CAMKII 
and CAMKIV, PLC, PKA and the SERCA pump inhibited the ability of 
glucagon to stimulate intrahepatic lipolysis, glucose production and 
Vpc (Fig. 2b, Extended Data Fig. 4e-n). To examine the effect of glucagon 
stimulation of intrahepatic lipolysis in vivo, we knocked down Atgl 
(also known as Pnpla2) in a liver-specific manner in wild-type mice 
and their /nsp3r1-knockout littermates (Extended Data Fig. 5a-c). Atgl 
knockdown abrogated the ability of glucagon to stimulate HGP, V,. and 
to increase long-chain acyl- and acetyl-CoA content in wild-type mice 
(Fig. 2c-h), which demonstrates a critical role for the stimulation of 
intrahepatic lipolysis in mediating the increases in each parameter that 
result from glucagon infusion. These alterations in hepatic gluconeo- 
genesis were again dissociated from changes in hepatic glycogen con- 
tent or the expression of gluconeogenic proteins in the liver. In addition, 
thealterations inthe rate of hepatic gluconeogenesis were dissociated 
from white adipose tissue lipolysis and from hepatic malonyl-CoA con- 
tent (Extended Data Fig. 5d-j). Taken together, these data demonstrate 
that glucagon acutely stimulates hepatic gluconeogenesis via INSP3R1 
by promoting intrahepatic lipolysis through the stimulation of ATGL, 
and thereby increases hepatic acetyl-CoA and gluconeogenesis via 
allosteric activation of PC. 

We hypothesized that glucagon may also stimulate hepatic mito- 
chondrial oxidation via INSP3R1. Glucagon increased both mitochon- 
drial and cytosolic calcium signalling in hepatocytes from wild-type 
mice, whereas /nsp3rl-knockout mice manifested a reduced response 
to glucagon, with some residual calcium responsiveness that was prob- 
ably attributable to the activity of INSP3R2 and INSP3R3. Vasopressin 
causeda similar increase in cytosolicand mitochondrial calcium signal- 
ling in wild-type, but not /Insp3r1-knockout, mice. However, incubation 
of wild-type hepatocytes with a PLC inhibitor completely abrogated 
the mitochondrial and cytosolic calcium responses to glucagon. PKA 
inhibition reduced the amplitude of the mitochondrial—but not the 
cytosolic—calcium response, and lowered the percentage of responding 
cells by 75% (Extended Data Fig. 6a—m). Ex vivo positional isotopomer 
nuclear magnetic resonance (NMR) tracer analysis (PINTA) revealed 
that glucagon stimulated hepatic mitochondrial oxidation (measured 
as the rate of citrate synthase flux (V,;)) in vivo, increasing V,. five- 
fold in wild-type mice; this could mostly be attributed to increased 
mitochondrial fat oxidation (Fig. 3a, b). However, these increases in 
hepatic mitochondrial oxidation in response to glucagon did not occur 
in/nsp3r1-knockout mice. Taken together, these data suggest that gluca- 
gon stimulates hepatic mitochondrial oxidation through the activation 
of INSP3R1, which results in increased levels of intramitochondrial 
calcium and-in turn—stimulation of mitochondrial dehydrogenases”. 
Consistent with this hypothesis, we observed a sevenfold increase in 
the rate of hepatic pyruvate dehydrogenase flux (V>p,,) in wild-type, but 
not /nsp3rl-knockout, mice infused with glucagon, without any differ- 
ence inthe rate of pyruvate kinase flux (V,,) (Extended Data Fig. 6n, 0). 
Glucagon stimulation of mitochondrial oxidation was not dependent 
on alterations in insulin: wild-type mice infused with somatostatin, 
basal insulin and approximately 100-pM glucagon exhibited increases 
in V., similar to those of wild-type mice not treated with somatosta- 
tin—however, again, no glucagon stimulation of V.; was observed in 
their Insp3r1- knockout littermates (Extended Data Fig. 6p). In vitro 
studies confirmed an increase in oxygen consumption with glucagon 
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Fig. 3| Chronicincreases in mitochondrial oxidation with a continuous 
3.5-week glucagon infusion reverse hepatic steatosis and improve glucose 
tolerance in an INSP3R1-dependent manner. a, b, Liver V., and the rate of fatty 
acid oxidation (V,,,) with acute glucagon infusion. n=5 wildtype - glucagon, 
6knockout - glucagon, 6 wild type + glucagon and 5 knockout + glucagon. c, 
Liver triacylglycerol (TAG) concentrations in high-fat-diet-fed mice chronically 
infused with glucagon. n=9 wildtype - glucagon, 8 wild type + glucagon, 

8 knockout - glucagon and 7 knockout + glucagon. d, Liver ceramide. n=9 wild 


stimulation in isolated hepatocytes from wild-type, but not /nsp3r1- 
knockout, mice (Extended Data Fig. 6q). These data demonstrate that 
glucagon mediates these effects in a cell-autonomous manner. 

Liver triglyceride concentrations did not differ between chow-fed 
wild-type and /nsp3r1-knockout mice (Extended Data Fig. 6r), which 
suggests that basal physiological levels of glucagon may not increase 
mitochondrial fat oxidation in ad libitum fed mice to a sufficient 
extent to alter hepatic triglyceride content. However, we hypothesized 
that the chronicincreases in hepatic mitochondrial oxidation induced 
by chronic glucagon treatment would reverse nonalcoholic fatty liver 
disease and improve whole-body insulin sensitivity. To investigate 
this, we performed chronic glucagon infusions in awake rats with diet- 
induced obesity, and found that after ten days of treatment chronic 
glucagon infusion doubled the rate of hepatic mitochondrial oxidation 
(which could be attributed to increased rates of hepatic fat oxidation) 
(Extended Data Fig. 7a-g). Two hours after the withdrawal of glucagon, 
rats exhibited lower fasting plasma glucose and insulin concentrations 
that were associated with 50-90% reductions in hepatic triglyceride and 
diacylglycerol concentrations as well as marked reductions in protein 
kinase C-e (PKCe) translocation—despite similar food intake, body 
weight and hepatic ceramide content (Extended Data Fig. 7h-o, Sup- 
plementary Tables 3, 4). Consistent with chronic increases in hepatic 
glycogenolysis, chronic glucagon infusion also resulted in reductions in 
liver glycogen content. In contrast to acute glucagon infusion, chronic 
glucagon infusion with withdrawal of glucagon treatment four hours 
before the tissues were isolated lowered both the hepatic acetyl- and 
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type - glucagon, 7 wild type + glucagon, 9 knockout - glucagon and 8 knockout 
+ glucagon.e, Liver diacylglycerol (DAG).n=9 without glucagon, 7 with 
glucagon. f, PKCe translocation (n=6). g,h, Plasma glucose and insulin during a 
glucose tolerance test. n=10 wild type without glucagon (WT vehicle), 

11 wildtype with glucagon (WT glucagon), 8 knockout (KO) vehicle, 

8 knockout glucagon). Inall panels, mean +s.e.m.is shown. *P< 0.05, **P<0.01, 
***P<(0.001. Groups were compared using two-tailed unpaired Student’s ¢-test. 
Allnvalues refer tonumbers of mice. 


malonyl-CoA content (Extended Data Fig. 7p-r); the reduction in mal- 
onyl-CoA content potentially contributes to the observed reductions 
in hepatic lipid concentrations due to suppression of hepatic lipogen- 
esis”. Consistent with their lower hepatic lipid and acetyl-CoA content, 
rats that were chronically infused with glucagon manifested improved 
glucose tolerance and insulin sensitivity (Extended Data Fig. 7s—v). 

To determine whether INSP3R1-dependent calcium signalling medi- 
ates the ability of chronic hyperglucagonaemiato reverse nonalcoholic 
fatty liver disease, we performed a four-week continuous infusion of 
glucagon in wild-type and /nsp3r1-knockout mice fed a high-fat diet. 
Despite unchanged body weight, food and water intake, and energetics, 
wild-type mice treated with glucagon exhibited 50-80% reductions 
in hepatic triglyceride and diacylglycerol (but not ceramide) content 
and in PKCe translocation that resulted in a marked improvement in 
glucose tolerance, whereas each of these parameters was unchanged 
in /nsp3r1-knockout mice (Fig. 3c—h, Extended Data Fig. 8a-j, Supple- 
mentary Tables 5, 6). 

Collectively, our studies reveal that glucagon stimulates intrahepatic 
lipolysis through an INSP3R1- and CAMKII-dependent process that 
increases hepatic acetyl-CoA content (which allosterically activates 
V, 18), and that this phenomenon explains the acute transcription- 
independent ability of glucagon to acutely stimulate gluconeogenesis 
in vivo. In addition, glucagon stimulates hepatic mitochondrial oxida- 
tion through INSP3R1-mediated calcium signalling, and this process 
can be exploited to reverse nonalcoholic fatty liver disease and hepatic 
insulin resistance with short-term continuous glucagon treatment. 


These data reveal a transcription-independent alternative mecha- 
nism (Extended Data Fig. 9) for glucagon action. Our studies provide 
evidence in support of INSP3R1as a potential target for the treatment 
of nonalcoholic fatty liver disease, nonalcoholic steatohepatitis and 
type-2 diabetes; future clinical studies will be required to examine 
this possibility. 
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Methods 


Sample size was determined based on a power calculation to detect 
moderate to large differences between groups”. The experiments 
were not formally randomized, but littermates (wild-type and knock- 
out) were usedin parallel for each experiment. Investigators were not 
blinded to allocation during experiments but were blinded during 
outcome assessment. 


Animals 

All protocols were approved by the Yale University Animal Care and Use 
Committee. Liver-specific /nsp3rl-knockout mice were generated as 
previously described”, and inall experiments littermates were studied 
at 10-12 weeks of age. Mice were fed standard chow (Teklad no. 2018) 
and housed on a12-h light/dark cycle in the Yale Animal Resources 
Center. To knock down ATGL ina liver-specific manner, an adeno- 
associated virus targeting ATGL (Vector BioLabs, 10” genome copies 
per mouse) was administered by retro-orbital injection four weeks 
before studies. Male mice were used for in vivo studies and both sexes 
were used for in vitro studies. One week before in vivo studies, mice 
underwent surgery under isoflurane anaesthesia to place a catheter 
in the jugular vein. Postsurgical recovery was demonstrated by con- 
firming that mice had regained their presurgical body weight before 
any in vivo studies. Mice were fasted overnight before in vivo stud- 
ies, unless otherwise stated. In the acute glucagon-infusion studies, 
mice were given an intravenous infusion of glucagon (5 ng per kg body 
weight per minute (kg™ min™)) for two hours, with tissue and plasma 
samples obtained after two hours of infusion. Mice were killed using 
intravenous pentobarbital at the conclusion of the terminal study. 
Inthesomatostatin infusion study, a 90-min jugular venous infusion of 
somatostatin (4 pg kg? min“, 1:1 mix of somatostatin 14 and somatosta- 
tin 28) concurrently with [3-"C]lactate and [?H,]glucose, as described 
in‘In vivo studies’, was administered. During the somatostatin infusion, 
mice were also infused with insulin (2 mU kg? min”) and glucagon 
(6 ng kg? min”). In the acute high-dose glucagon experiment, mice 
were injected with 600 pg kg? glucagon and euthanized 20 min later, 
after isoflurane anaesthesia. Blood was collected by cardiac puncture, 
and the liver was excised and freeze-clamped. 

Mice infused chronically with glucagon were fed a high-fat diet 
(Research Diets D12492) for 4 weeks, after which they were implanted 
with an Alzet pump providing glucagon continuously (0.15 ng min”) for 
another 3.5 weeks, during which time they were continued ona high-fat 
diet. Mice underwent CLAMS metabolic cage analysis to assess energet- 
ics, as well as food and water intake, during the second week of glucagon 
infusion. After an overnight fast, three weeks after implantation of the 
glucagon pumps, mice underwent an intraperitoneal glucose tolerance 
test and were subsequently re-fed. Forty-eight hours later, after a 6-h 
fast, mice were euthanized under isoflurane anaesthesia for measure- 
ment of hepatic lipid content as described in ‘Biochemical analysis’. 

For the chronic glucagon-infusion studies in rats, 300 g male 
Sprague-Dawley rats were obtained from Charles River Laboratories 
and fed asafflower-oil-based high-fat diet (60% calories from fat) (Dyets 
no. 112245) for 4 weeks. During week 3 of the diet, rats underwent sur- 
gery under isoflurane anaesthesia to place catheters in the jugular 
vein and carotid artery, and recovery was confirmed by regaining the 
presurgical body weight before in vivo studies. After four weeks ona 
high-fat diet, rats were placed in a soft plastic harness to protect their 
catheters and infused continuously for 10 days with glucagon (5ng kg 
min“, toatotal volume5 ml kg? day”). The glucagon infusion was either 
continued throughout the terminal study (PINTA) or discontinued two 
hours before the start of the terminal study (glucose tolerance test 
with hepatic lipid, acetyl-CoA or glycogen measurements), in separate 
groups of rats, as specified in the figure legends. Rats were fasted for 8h 
before being euthanized with an intravenous injection of pentobarbital. 


In vivo studies 

In all in vivo mouse studies, blood was collected from the tail vein, 
with the exception of portal-vein glucagon measurements, for which 
a needle was inserted into the portal vein of anaesthetized mice to 
collect blood. In the rat studies, blood was collected from the jugular 
venous catheter. In both species, samples were immediately centri- 
fuged (12,000 rpm) to separate plasma from red blood cells. We used 
PINTA to measure hepatic mitochondrial fluxes in both rats and mice”’. 
In brief, rodents wereinfused with a2-h primed (5 min, 3x)-continuous 
infusion of [3-°C]lactate (40 pmol kg? min”) and either [1,2,3,4,5,6, 
6°H,]glucose (0.1 mg kg? min) or [3-H] glucose (0.1 pCi kg? min”). 
At the conclusion of the study, animals were euthanized with an intra- 
venous injection of pentobarbital. 

In mice, hepatic glycogenolysis was assumed to be negligible owing 
to their prolonged (16 h) fasted state and their low hepatic glycogen 
content (Extended Data Figs. If, Si, 7p). HGP was measured by determin- 
ing the specific activity of [7H]glucose in plasma using a scintillation 
counter, and the V,,/Vj4¢p ratio was calculated using the equation 


Voc _ G2 
Vice XFE? 


in which G2 represents the [m+ 2]glucose enrichment corrected for any 
[m+ 2]glucose synthesized from °C,-labelled trioses: corrected [m+ 2] 
glucose = G2 = measured [m + 2] glucose — 2 x C4C5C6[m + 2 ]glucose, 
and XFE represents the fractional triose enrichment: 


1 
GL 
2x G2 


XFE = 
1+ 


in which G1 represents the measured [m + I]glucose, and G2 is as 
described above. To calculate absolute V,,, we multiplied the measured 
HGP bythe ratio V,,/Vycp. The ratio of hepatic V,./Vc, was calculated as 


Voc __[5-C]glucose _ 
Ves 2x[4-PC]glucose 


and the absolute V, was calculated by dividing V,. by V,,/V¢.. The deri- 
vations of each equation are described in detail in a previous study”. 
We corrected for the possible contribution of [°C]bicarbonate to label 
the TCA cycle, as previously described?*. 

We measured the ratio 


Voon _ [4-2 C]glutamate 
Ves [3 -C]alanine 


and calculated the absolute V,,,, by multiplying this ratio by the meas- 
ured V,,. Finally, the ratio of V,,—assuming minimal malic enzyme flux— 
to (Vpc + Vppy) was calculated as (ref.?’) 


Vox ___ [2-8 C]alanine 
VoctVopn [5-C]glucose 


Absolute V,, rates were then determined by multiplying Vp,/(Vpe+ Vepy) 
by the sum of V,, and Vpp,,. AS has previously been described”’, Vp-/Vcs 
can be expanded to account for pyruvate recycling(ref. 7”): 


Voc + 5Vox __ [5-¥C]glucose _ 
Ves 2x [4 PC]glucose 


Inthis study, we measured a maximum V,,/(Vpc + Vppy) of 0.4, indicating 
that the maximal Vpx/Vpc is 0.4. A Vox/Vpc at this maximal value would 
generate a 17% underestimation of V,./Vcs. Ex vivo NMR analysis was 


used to confirm PINTA measurements of flux ratios as has previously 
been described’’. 

In rats, HGP was measured by measuring the plasma glucose [m+ 7] 
atom per cent enrichment (APE) by gas chromatography-mass spec- 
trometry (GC-MS) and using these data to calculate HGP according 
to the equation HGP = (tracer APE/plasma APE - 1) x infusion rate. All 
other flux ratios and absolute fluxes were measured using the equa- 
tions given above. 

In the glucose tolerance tests, rodents wereinjected with1g kg?50% 
dextrose (rats) or 10% dextrose (mice) intraperitoneally. Blood sam- 
ples were taken through the venous catheter (rats) or by tail bleeding 
(mice) for measurement of plasma glucose and insulin concentrations 
as described in ‘Biochemical analysis’. 


Biochemical analysis 

Glucose concentrations in plasma and incell-culture media were meas- 
ured using the YSI Glucose Analyzer. Plasma insulin was measured by 
enzyme-linked immunosorbent assay (ELISA) (Mercodia), and glucagon 
was measured in samples immediately spiked with aprotinin (0.5 mg 
ul) whole blood by radioimmunoassay by the Yale Diabetes Research 
Core. Plasma NEFA concentrations were measured using the Wako NEFA 
assay, and plasma glycerol® and plasma and liver amino acid concen- 
trations” were measured by GC-MS. Liver acetyl- and malonyl-CoA® 
(standard curve R?= 0.999 and 0.999), long-chain acyl-CoA”, DAG and 
ceramide” concentrations were measured by liquid chromatography 
withtandem mass spectrometry, hepatic glycogen content was meas- 
ured following amyloglucosidase digestion” and TAG concentrations 
were measured enzymatically. cAMP concentrations were measured 
using the Enzo Life Sciences Direct cAMP ELISA. Protein concentra- 
tions were measured by western blot, using antibodies from Santa 
Cruz (PC, PEPCK and CAMKII), Cell Signaling (ATGL, pCAMKIV, CAM- 
KIV, pCRTC2, CRTC2, pACC, ACC, pAMPK, AMPK, pHSL, HSL, GAPDH 
and B-actin), Novus Biologicals (pCAMKII), Abcam (pATGL), LSBio 
(pINSP3R1) and BD Transduction Laboratories (INSP3R3 and PKCe). The 
antibody to pATGL was kindly provided by H. S. Sul, and the antibody 
to total INSP3R1 was custom-made using an epitope against the last 
18 amino acids of INSP3RI1. The antibody to INSP3R2 was generously 
donated by the Wojcikiewicz Laboratory. The antibody to CRTC2 as 
well as hepatocyte control samples (wild-type + glucagon and Crtc2 
knockout) for antibody validation were a kind gift of M. Montminy. 
To measure protein expression in isolated hepatocytes, a Mitochon- 
dria Isolation Kit (Thermo Scientific) was used to separate cells into 
cytoplasmic and crude mitochondrial fractions, the latter containing 
both mitochondria and closely apposed mitochondria-associated 
proteins. The final spin was conducted at 1,000 rpm for 15 min at 4 °C 
to obtain a cleaner mitochondrial fraction with reduced lysosomal 
contamination. Mitochondria were lysed in 2% CHAPS in tris-buffered 
saline (TBS), and the protein concentration was measured with the 
bicinchoninic acid assay (Thermo Scientific). Samples for all western 
blot analyses were loaded on 4 to 12% gradient gels (NuPage gels, Life 
Technologies) and run with MOPS buffer. Protein was transferred to 
polyvinylidene difluoride membranes by wet transfer. After blocking 
with 5% milkin TBS with 0.1% Tween 20, primary antibodies were applied 
overnight. Gluconeogenic gene mRNA expression was measured by 
quantitative PCR* using primers with the following sequences: Actb: 
forward (F), CCAGATCATGTTTGAGACCTTC, reverse (R), CATGAGGTAG 
TCTGTCAGGTCC; Pepck (also knownas Pck1): F, CAGGAAGTGAGGAAGTT 
TGTGG, R, ATGACACCCTCCTCCTGCAT; G6Pase (also knownas G6pc): 
F, GAAGGCCAAGAGATGGTGTGA, R, TGCAGCTCTTGCGGTACATG; and 
Pc(also known as Pcx): F, AGATGCACT TCCATCCCAAG, R, CCTTGGTCAC 
GTGAACCTTT 


In vitro studies 


Primary hepatocytes were isolated by the Yale Liver Center from wild- 
type and /nsp3r1-knockout mice, and plated on glass coverslips coated 


with rat collagen type I. Cells were loaded with the cytosolic Ca”* indi- 
cator dye Fluo-4 AM (Thermo Fisher Scientific), or the mitochondrial- 
matrix-targeted Ca”* indicator Rhod-2 AM (Thermo Fisher Scientific), 
for 30 min at 37 °C. Ca”* imaging experiments were performed in 
4-(2-hydroxyethyl)-1-piperazine ethanesulfonic acid (HEPES)-buffered 
balanced salt solution (25 mM HEPES, 121 mM NaCl, 4.7 mM KCI, 1.2mM 
MgSO,,1.2mM KPO,,5mM NaHCO,, 2.0 mM CaCl,, 10 mM glucose, pH 
7.4) with or without the PKA inhibitor H-89 (25 1M) or the PLC inhibi- 
tor U73122 (10 uM) 2-4 h following the initial plating. Coverslips were 
transferredtoa custom-built perfusion chamber on the stage of a Zeiss 
LSM 710 confocal microscope (Carl Zeiss Microscopy). Cytosolic and 
mitochondrial signals were monitored in Fluo-4 AM and Rhod-2-AM- 
loaded cells, respectively, using stimulation with 100-nM glucagon 
(Sigma-Aldrich) or 100-nM vasopressin (Sigma-Aldrich) with a 20x 
objective lens. Changes in fluorescence were normalized tothe initial 
fluorescence before addition of agonist and were expressed as F/Fo. 

Forin vitro glucose production and lipolysis studies, primary mouse 
hepatocytes were isolated by the Yale Liver Center. Following removal 
of cell debris by Percoll density gradient centrifugation, cells were 
plated on 6-well collagen-I-coated dishes (4.0 x 10° cells per well) in 
2 ml recovery medium (DMEM high glucose containing 10% FBS, 2% 
penicillin-streptomycin, 100 nM dexamethasone, 1nM insulin and10 
mMHEPES). After incubation for 6 hat 37 °C and 5% CO,, the attached 
cells were washed once in 1x PBS and then incubated overnight in 2 
ml low-glucose culture medium (DMEM low glucose supplemented 
with 10% FBS, 2% penicillin-streptomycin and 10 mM HEPES) for glu- 
cose production studies, or serum-free low-glucose culture medium 
(DMEM low glucose supplemented with 0.5% fatty acid free BSA, 2% 
penicillin-streptomycin and 10 mM HEPES) for lipolysis assays. The 
next morning, cells were washed twice in 1x PBS and culture media 
replaced with 2 ml glucose production medium (DMEM base medium 
supplemented with 0.5% fatty acid free BSA, 20 mM sodium lactate, 
2 mM sodium pyruvate and 10 mM HEPES, pH 7.4) or serum-free low 
glucose culture medium (lipolysis assay) in the presence of 100 nM 
glucagon or vehicle control. After incubation for 8 h at 37 °C and 5% 
CO,, cell culture medium was collected for analysis of glucose, NEFA 
and glycerol concentrations as described in ‘Biochemical analysis’. 
Vp was determined by measuring V,,/V;¢p by PINTA as described in ‘In 
vivo studies’ using 300 pl of the collected medium and multiplying this 
ratio by the measured glucose production rate. In a subset of studies, 
cells were incubated for 8 h during the glucose production assay in 
medium containing one of the following agents (all dissolved in 0.5% 
DMSO), or 0.5% DMSO vehicle: 20 1M ET-18-OCH, (Santa Cruz), 100 
uM U-73122 (Sigma), 30 nM thapsigargin (Sigma), 100 nM vasopres- 
sin (Sigma), 50 1M 2-APB (R&D Systems), 70 uM caffeine (Sigma) or 
1mMmalic enzyme inhibitor hydroxymalonate (Sigma). Toinhibit PKA, 
hepatocytes were incubated in H-89 dichloroacetate hydrate (Sigma, 
25 uM) dissolved in medium, or medium lacking H-89 as a control. Inthe 
insulin treatment studies, hepatocytes were incubated in 1nMinsulin 
with 5% BSA and glucose production medium for the duration of the 
glucose production assay. In thein vitro atglistatin study, hepatocytes 
were incubated in 10 pM atglistatin in 0.1% EtOH, or 0.1% EtOH vehicle. 
All values were normalized to total protein content determined from 
whole-cell lysates by a BCA protein assay (Thermo Fisher Scientific) 
according to the manufacturer’s instructions and expressed as fold 
change versus vehicle-treated cells. 

To assess oxygen consumption, primary mouse hepatocytes were 
isolated by the Yale Liver Center and plated on collagen-I-coated XF24 
V7 cell culture plates (1.2 x 10‘ cells per well) in2 ml recovery medium, 
as previously described®. Following incubation for 6 hat 37 °Cand 5% 
CO,, the cells were washed twice with 1 ml low-glucose culture medium 
and incubated overnight in 250 pl low-glucose culture medium at 37 °C 
and 5% CO,. The following morning, cells were washed with XF24 assay 
medium (DMEM base medium containing 1.0 mM pyruvate, 2mM glu- 
tamine and 5.5 mM glucose, pH 7.4). Five hundred microlitres of XF24 
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assay medium was added to each well and plates were equilibrated at 
37 °C for 1h. Four measurements of basal oxygen consumption rates 
(picomoles per minute) were recorded on a Seahorse Bioscience XFe 
24 Analyzer (Seahorse Biosciences) using an instrument protocol of 
3-min mix, 2-min wait and 3-min measure. After baseline measurements, 
glucagon (or vehicle) was injected at a final concentration of 1OO nM 
and oxygen consumption was recorded using the same instrument 
protocol. Ten measurements were taken following injection and the 
average of eight measurements was used for subsequent analyses. 
Oxygen consumption rates were normalized to total protein content 
and expressed as fold change compared to vehicle-treated cells. 


Statistics and reproducibility 

Group sizes were chosen to detect moderate-to-large (>40%) differ- 
ences with approximately 40% standard deviations. Comparisons were 
performed using GraphPad Prism 7. Each in vivo experiment was per- 
formed with the number of replicates specified in the figure legends. 
All data obtained are shown, with the exception of several western blots 
in which case one representative image is shown; all data are shownin 
the quantification. The two-tailed paired (when comparing the same 
animals under multiple conditions) or unpaired Student’s t-test (when 
comparing different animals) was used to compare datasets. P values 
less than 0.05 were considered significant. Comparisons were per- 
formed using GraphPad Prism 7. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Glucagon acutely stimulates hepatic gluconeogenesis 
by increasing hepatic acetyl-CoA content and PC flux. a, Body weight (n=11). 
b, Expression of INSP3RI1 protein in the liver (n=5). Blotsinb, c, eand Figs. If, 
2a, Extended Data Figs. 3f, g,4a were stripped and reprobed for all proteins of 
interest. *P< 0.05 versus /nsp3rl-knockout mice not treated with glucagon. 

c, INSP3R protein expression incytosolic (c.) and crude mitochondrial (c.m.) 
fractions from primary hepatocytes, in which VDAC was examined as a marker 
for mitochondrial protein content, and calreticulin as a marker for non- 
mitochondria-associated membrane protein. Right, phosphorylation of 
INSP3R1in the liver of mice infused with glucagon (n=5).d, Phosphorylation of 
CRTC2 in the liver (n=5). The CRTC2 phosphorylation gel was stripped and 
reprobed to assess HSP90 (loading control). e, Phosphorylation of CAMKIV in 
the liver, with or without a2-h acute infusion of glucagon (n=5).f, Liver 
glycogencontent (n=5 wild type - glucagon, otherwise n=6). No differences 
were observed using one-way analysis of variance with Bonferroni’s multiple 
comparisons test. g-i, Plasma [m+ 1], [m+2]and [m+7]glucose enrichment 
during a 2-hinfusion of [3-°C]lactate and [?H,]glucose (n=5 wild type and 

6 knockout, with the exception ofi, for whichn=4 wild type + glucagonat 

100 and 110 min).j, k, Plasma total amino acid and alanine concentrations 

(n=5 wild type and 6 knockout). Inj, k, groups were compared before and after 


glucagon treatment by two-tailed paired Student’s t-test, and genotypes were 
compared by two-tailed unpaired Student’s ¢-test. I, m, Liver total amino acid 
and alanine concentrations (n=5).n, 0, In vitro glucose production (n=9) and 
Vpc (n=4) inisolated hepatocytes. p, q, In vitro glucose production (n=9) and 
Vp¢ (n=4) in isolated hepatocytes with and without 150 pM insulin. Basal data 
(no insulin) are duplicated from nando.r,s, In vitro glucose production (n=8) 
and V,, (n=3) inisolated hepatocytes with and without a malic enzyme (ME) 
inhibitor. **P< 0.01, ****P< 0.0001 versus wild type - glucagon - ME inhibitor. 
t-v, Plasmaconnecting peptide, glucagon and glucose concentrations in mice 
(n=6wild type and 7 knockout) treated with somatostatin, basal insulin and 
glucagon. Comparisons before and after glucagon treatment used atwo-tailed 
paired Student’s t-test. w, x, Endogenous glucose productionand Vp. 

(n= 6 wild type and 7 knockout). Inall panels, comparisons with and without 
glucagon, insulin or malic enzyme inhibitor, and wild type versus knockout 
comparisons were performed using a two-tailed unpaired Student’s ¢-test, 
unless otherwise stated. In all panels in which comparisons were performed 
(thatis, all panels with the exception of g, i), ifno Pvalue is shown, groups were 
not significantly different. Inall panels, mean+s.e.m. isshown. Alln values 
refertonumbers of mice. 


3 

p38 FI _ ; 

58 B lg i 3 9 2 
Glucagon = = + + Glucagon : : + + 
ET-18-OCH, = + - + ET-18-OCH, + - + 

e f . re 
4 _ 
P=0.002 5° 
3 7 =.3 
os ° 8 
g | ge 
§2 2 *3 ? 
os £ 
14 ¢ 14 
0 | 04 

Glucagon > + + Glucagon 

H-89 - + + H-89 
. . 

] P=0.02 J P=0.0001 

3 = * 
| i 
i a 2 a 24 
g 8 of £8 
22 - 2 9 J - o 58 
3 J22 wa a E lacy ie ea 
0+, 7 . y oO S S + fy 

Glucagon - = + + Glucagon - - + + 

2-APB - + re + 2-APB = + . + 
P=0.0008 
28 P<0,0001 7 fra 
] pees } 
20! el a is 
3 g 3 : ; 

Be 15) ° a | = are 
ie Pee 0 8, 7% o pe Bo 82 
BS 10/8 F: D ES iea ; 

oom, e . . o =, —_ Hl . 
Glucagon - - + + Glucagon - = + + 
KN-93 - 7 - + KN-93 - + - + 


Extended Data Fig. 2| Glucagon-stimulated glucose production requires 
activation of the PLC and PKA pathways, converging to activate INSP3 
signalling. a—p, In vitro glucose production and V,,inisolated hepatocytes 
with and without ET-18-OCH, (n= 3) (a, b), U-73122 (n=3, except for 
knockout — glucagon-U-73122 in d (n=2)) (c, d), H-89 (n= 6) (e, f), vasopressin 
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Extended Data Fig. 3 | Glucagon stimulates HGP independently of 
transcriptional regulation, and hasakey role in the maintenance of blood 
glucose levels during a prolonged fast. a—c, Liver Pc, Pepck and G6Pase mRNA 
expression (n=5).d,e, Liver PC and PEPCK protein (n=5, except for 

knockout + glucagon (n=6)).f, g, Liver pACC/ACC and pAMPK/AMPK ratios 
(n=5).Blotsinf, gand Figs. 1f, 2a, Extended Data Figs. 1b, c, e, 4a were stripped 
and reprobed for all proteins of interest. h, Plasma NEFA (n=5, except for 


knockout + glucagon (n=6)).i, Liver malonyl-CoA (n=6).j-I, Plasma glucose, 
insulin and glucagon concentrations in mice fasted for 48 h (n=5).m-o, Liver 
long-chain-, acetyl- and malonyl-CoA content (n=5).Inall panels, genotypes 
and groups + glucagon were compared using a two-tailed unpaired Student’s 
t-test. If no statistical comparison is denoted, the groups were not significantly 
different. In all panels, mean +s.e.m. is shown. Alln values refer to numbers of 
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Extended Data Fig. 4 | Glucagon acutely stimulates gluconeogenesis by 
activating intrahepatic, but not white adipose tissue, lipolysis. a, Liver HSL 
phosphorylation (n=5). These blots, and blots in Figs. 1f, 2a, Extended Data 
Figs. 1b,c, e, 3f, g, were stripped and reprobed for all proteins of interest. Ina-c, 
groups were compared using a two-tailed unpaired Student’s ¢-test. b,c, Invitro 
NEFA (n=14 wild type and 15 knockout) and glycerol production (n=5 wild type 
and 6 knockout) from isolated hepatocytes. d, Plasma NEFA concentrations in 
mice treated with somatostatin, replacement basal insulin and glucagon 
(n=6wildtypeand 7 knockout). No significant differences were observed 
between genotypes using a two-tailed unpaired Student t-test, or before versus 
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Extended Data Fig. 5 | Glucagon requires INSP3-mediated intrahepatic 
lipolysis to promote V,, and hepatic gluconeogenesis. a, Body weightin mice 
treated with an adeno-associated virus to knock down liver Atgl (n=6). 

b, Representative western blots. Blots from the same tissue (liver or white 
adipose tissue (WAT)) were stripped and reprobed for all proteins shown. 

c, White adipose tissue and liver ATGL protein expression. Inc-e,n=6, except 
for wild type + Atg/ knockdown and knockout (n=4).d, e, Hepatic PC and PEPCK 
protein expression. f-h, Plasma glucagon (n=6, except for wild type + Atgl 
knockdown (n=5), NEFA (n= 6, except for wild type + Atg/ knockdown and 
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knockout (n=5), and glycerol concentrations (n= 6, except for wildtype 

+ Atgl knockdown (n=5) in mice treated with an adeno-associated virus to 
knock down ATGL ina liver-specific manner. Groups were compared before and 
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is denoted, the groups were not significantly different. Error bars represent 
s.e.m. All nvalues refer to numbers of mice. 
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Extended Data Fig. 6 | Glucagon stimulates mitochondrial oxidation 
through hepatocellular calcium signalling. a, Representative mitochondrial 
response to glucagon, which was added where denoted by the ‘glucagon’ bar. 
b, Maximum mitochondrial responseto glucagon (n =324 wild type and 

167 knockout). Groups inb, d, fand h were compared by two-tailed unpaired 
Student’s t-test. c, Representative cytosolic response to glucagon. d, Maximum 
cytosolic response to glucagon (n= 146 wildtype and 175 knockout). 

e, Representative mitochondrial response to the INSP3R agonist vasopressin. 
f, Amplitude of the mitochondrial response to vasopressin (n= 73 wildtype and 
42 knockout). g, Representative cytosolic response to vasopressin. 

h, Amplitude of the cytosolic response to vasopressin (n=119 wildtype and 

53 knockout). i, Representative mitochondrial responses to glucagon (added 
where denoted by the bar) in wild-type hepatocytes incubated inthe PKA 
inhibitor H-89 or the PLC inhibitor U-73122.j, Amplitude of the mitochondrial 
response (n=79 control, 39 H-89 and 127 U-73122). Groups were compared to 
the control inj, I, musing a two-tailed unpaired Student’s t-test. 


k, Representative cytosolic responses to glucagon + H-89 or U-73122. 

I, Amplitude of the cytosolic response (n=182 control, 27 H-89 and 

187 U-73122). m, Percentage of cells with a cytosolic response (>110% baseline) 
to glucagon+ H-89 or U-73122 (n=11 control, 8 H-89 and 3 U-73122). 

n,o, Hepatic V,p, and V,, in vivo (n=5 wild type - glucagon, 

6wild type + glucagon, 6 knockout — glucagon and 5 knockout + glucagon). 
Inn, 0, groups were compared (with versus without glucagon treament, and 
wild type versus knockout) using a two-tailed unpaired Student's t-test. p, Liver 
V.;in mice infused with somatostatin, basal insulin and glucagon (n=5). 

q, In vitro oxygen consumption inisolated hepatocytes incubated + 100nM 
glucagon (n=113 wild type - glucagon, 144 knockout - glucagon, 

149 wild type + glucagon and 210 knockout + glucagon). r, Liver triglyceride 
content (without glucagon infusion) (n=11). If no statistical comparison is 
denoted, the groups were not significantly different. Error bars represent 
s.e.m. All n values refer to numbers of mice. 
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Extended Data Fig. 7 | Chronic increases in mitochondrial oxidation witha 
ten-day glucagoninfusion lead to reversal of nonalcoholic fatty liver 
disease and improvements in glucose tolerance. a, Plasma glucagon 
concentrations on the last day of infusion (n=6ina-g).b, Hepatic V,.. Inb-g, 
measurements were performed while the glucagon infusion continued. 

€,d, Voc/Vecpand Vp-/Vcs ratios. e-g, Hepatic Vpp,, Veg and V,,.h, Food intake 
during the glucagon infusion (n=8 control and 10 glucagon), determined twice 
during the ten-day infusion (on day 4 and day 9) by weighing the foodinthe 
cage; the data points are the averages of the two food-intake measurements for 
eachrat.i, Body weight after ten days of glucagon or saline infusion (n=6). 

j,k, Plasma glucose and insulin concentrations (n= 6) measured two hours 
after cessation of the glucagon infusion, after a six-hour fast. I-n, Liver TAG 
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(n=6), DAG (n=8) and ceramide (n=8) concentrations. Ina-k, n= 6.0, Hepatic 
PKCe translocation (n=6 controland 7 glucagon). p-r, Liver glycogen (n=6), 
acetyl-CoA (n=8) and malonyl-CoA content (n=8).s,t, Plasma glucose 
concentrations and area under the curve (AUC) during an intraperitoneal 
glucose tolerance test, which began two hours after completing a ten-day 
continuous infusion of glucagon or saline (n=6).Ins,u, *P<0.05,**P<0.01, 
***P< 0.001. Data aremean+s.e.m. u, V, Plasma insulin and insulin area under 
the curve during the glucose tolerance test. Inall panels, error bars represent 
s.e.m, and groups were compared using a two-tailed unpaired Student’s t-test. 
Ifno statistical comparisonis denoted, the groupsare not significantly 
different. Alln values refer to numbers of rats. 
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Extended Data Fig.8 | Chronic glucagon treatment reverses nonalcoholic 
fatty liver disease and glucose intolerance in wild-type, but not Insp3r1- 
knockout, mice. a, Body weight (n=10 wild type - glucagon, 11 wild type+ 
glucagon, 8 knockout - glucagonand 8 knockout + glucagon). b,c, Food and 
water intake. In b-g,n=8 wild type- glucagon, 9 wild type + glucagon, 

7 knockout — glucagon and 8 knockout + glucagon. d, Energy expenditure. 

e, f, Oxygen consumption and carbon dioxide production. g, Activity. h, Plasma 
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NEFA. Inh-j,n=10 wild type - glucagon, 11 wild type + glucagon, 8 knockout -— 
glucagon and 8 knockout + glucagon. i,j, Glucose and insulin area under the 
curve during an intraperitoneal glucose tolerance test. In all panels, 
mean+s.e.m. is shown. Statistical comparisons were performed using a two- 
tailed unpaired Student's ¢-test. If no statistical comparison is denoted, the 
groups were notsignificantly different. All n values refer tonumbers of mice. 
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Extended Data Fig. 9 | Schematic summary. Mechanism by which glucagon acutely stimulates intrahepatic lipolysis, hepatic gluconeogenesis and hepatic 
mitochondrial oxidation. 
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Cancer recurrence after surgery remains an unresolved clinical problem’ °. Myeloid 
cells derived from bone marrow contribute to the formation of the premetastatic 
microenvironment, which is required for disseminating tumour cells to engraft 
distant sites* *. There are currently no effective interventions that prevent the 
formation of the premetastatic microenvironment®’. Here we show that, after surgical 
removal of primary lung, breast and oesophageal cancers, low-dose adjuvant 
epigenetic therapy disrupts the premetastatic microenvironment and inhibits both 
the formation and growth of lung metastases through its selective effect on myeloid- 
derived suppressor cells (MDSCs). In mouse models of pulmonary metastases, 
MDSCs are key factors in the formation of the premetastatic microenvironment after 
resection of primary tumours. Adjuvant epigenetic therapy that uses low-dose DNA 
methyltransferase and histone deacetylase inhibitors, 5-azacytidine and entinostat, 
disrupts the premetastatic niche by inhibiting the trafficking of MDSCs through the 
downregulation of CCR2 and CXCR2, and by promoting MDSC differentiation into a 


more-interstitial macrophage-like phenotype. A decreased accumulation of MDSCs in 
the premetastatic lung produces longer periods of disease-free survival and increased 
overall survival, compared with chemotherapy. Our data demonstrate that, even after 
removal of the primary tumour, MDSCs contribute to the development of 
premetastatic niches and settlement of residual tumour cells. A combination of 
low-dose adjuvant epigenetic modifiers that disrupts this premetastatic 
microenvironment and inhibits metastases may permit an adjuvant approach to 


cancer therapy. 


Despite advances in traditional cancer treatments and newly developed 
immunotherapies, cancer recurrence after surgery of non-small-cell 
lung cancer (NSCLC), oesophageal cancer and breast cancer remains 
high at 35-76%2°, 30-66%" and 20-66%’, respectively. Most cancer- 
related mortalities after resection are due to metastases”. In early- 
stage NSCLC in particular, 30-55% of patients die from recurrent 
metastatic disease after surgery with curative intent®”, and standard 
adjuvant chemotherapy confers an absolute 5-year survival benefit of 
only 3-10%". Current immune-directed neoadjuvant therapies using 
immune checkpoint blockade rely on the presence of the primary 
tumour to generate tumour-antigen-specific T cell responses”, but 
in clinical practice the majority of early-stage NSCLC tumours are 
removed without neoadjuvant intervention””’. Novel strategies are 


needed that decrease tumour recurrence and metastasesin the absence 
of a primary tumour. 


MDSCs delineate premetastatic niches 

In 2011, we initiated a randomized, phase-II adjuvant clinical trial using 
low-dose 5-azacytidine and entinostat in patients with stage-I (T1-2aNO) 
NSCLC. Thetrial was prematurely terminated after 13 patients enrolled, 
owing to the requirement for 5-azacytidine to be given in an outpa- 
tient clinic. Nevertheless, patients tolerated the therapy well witha 
low rate of postsurgical recurrence (14.3% versus 33.3%), which sug- 
gested that epigenetic therapy may decrease relapses after curative 
surgery (Extended Data Fig. la—c, Supplementary Table 1). A previous 


A list of affiliations appears at the end of the paper. 
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Fig. 1| Low-dose AET disrupts the lung premetastatic microenvironment by 
affecting MDSCs. a, Timeline of mouse models of metastasis with treatment 
schedules. CT, computed tomography. b, Longitudinal H&E and 
immunofluorescence staining of lung tissue showed the presence of tumour 
cells from day 6 inthe LLC model. Immunofluorescence staining was 
performed using GFP (green) antibodies. Merged images contain 
4’,6-diamidino-2-phenylindole (DAPI) DNA staining, which demarcates cell 
nuclei (blue). Scale bars, 2mm. Graph shows the area and numbers of 
metastatic nodules (n=3 mice at each time point). Two-sample, two-sided t- 
test. c, Immune-cell profiles of lungs in the LLC model. Single-cell suspensions 
from both lungs were analysed by fluorescence-activated cell sorting (FACS) 
(n=3 miceat each time point). NC, negative control (normal lungs from 
C57BL/6 mice). D, day. Two-sample, two-sided ¢-test was used in comparison 
with the negative control. T,,, cells, regulatory T cells.d, FACS showing lung 
MDSCsintheLLC modelat day 3 were depleted using pep-H6 (leftmost two 


study found that combined epigenetic modifiers blunt metastases ina 
mouse model of aggressive NSCLC with an immune-competent micro- 
environment“. Here we reveal a key mechanism of action of adjuvant 
epigenetic therapy (AET) on premetastatic niches that could underlie 
these previous findings. Low-dose AET modulates innate immune fac- 
torsin the lung microenvironment to inhibit tumour recurrence after 
resection in three syngeneic models of aggressive pulmonary metas- 
tasis: Lewis lung carcinoma (LLC), HNMOO7 oesophageal squamous 
cell carcinoma (mice of the LLC and HNMOO7 models have no extrapul- 
monary metastases) and 4T1 mammary cancer (Fig. 1a, Extended 
Data Fig. 2a-e). 

It is known that the microenvironments of distant organs—the 
targets of future metastases—are not passive receivers of 


panels). Column diagram showing the effect of pep-H6 on the percentages of 
lung MDSCs and macrophages (middle two panels) at day 3 (n=3 micein each 
group). Two-sample, two-sided t-test. Kaplan-Meier curves showing disease- 
free survival and overall survival of LLC mice after MDSC depletion (rightmost 
two panels). Two-sided log-rank test. Pep-irrel, irrelevant peptibody, used as 
control. e, Kaplan-Meier curves showing disease-free and overall survival of 
LLC mice after transfusion of lung MDSCs (5 x 10°) (leftmost two panels) and of 
monocytic (M-)MDSCs (5 x 10°) or polymorphonuclear (PMN-)MDSCs (5 x 10°) 
from bone marrow (rightmost two panels). All transfusions were conducted on 
day land day 4. Two-sided log-rank test. f, FACS showing representative effects 
of low-dose (LD-)AET on lung MDSCs in LLC and HNMO07 miceat day 3. Column 
diagrams showing the effect of low-dose AET on lung MDSCs in LLC and 
HNMOO7 mice (n=3 mice at each time point). Two-sample, two-sided ¢-test. 
Bars showmean +s.e.m. NS, not significant, *P< 0.05, **P< 0.01, ***P< 0.001. 


circulating tumour cells, but instead are selectively modified by the 
primary tumour before metastasis®”>"’. Settlement of tumour cells at 
distant sites is dependent on tumour-secreted factors and tumour- 
shed extracellular vesicles that enable the premetastatic microenviron- 
ment to support their colonization®”>"®, To study whether low-dose 
AET affects the dynamics of the premetastatic microenvironment, we 
used mouse models of highly aggressive, pulmonary metastasis that 
exhibit pulmonary metastases in 90-100% of mice after resection, 
with median disease-free survival times of fewer than 14 days (Fig. 1a, 
b, Extended Data Fig. 2a-d). 

In mice of the LLC model (hereafter LLC mice), no tumour-cell infiltra- 
tion of the lung was detectable by haematoxylin and eosin (H&E) and 
no green fluorescent protein (GFP)* tumour cells were observed by 
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Fig. 2|Low-dose AET inhibits migration of monocytic MDSCs from the bone 
marrowtothelung premetastatic microenvironment by downregulating 
expression of CCR2.a, b, The effect of low-dose AET on monocytic MDSCs 
transferred from CD45.1to CD45.2 mice (a) and the trafficking ability of 
adoptively transferred monocytic MDSCs from low-dose-AET- or vehicle- 
treated CD45.1 mice in CD45.2 recipient mice (b). FACS and graphs showing 
percentages and absolute numbers of the donor-derived-cell subset (CD45.1° 
cells) in both lungs of the recipient LLC mice (n=3 mice per group).c, Cell- 
sorting schema for collecting monocytic MDSCs on day 3 after resection (left). 
Volcano plots showing differences in RNA expression of monocytic MDSCs 
from the bone marrow (BM) or lung between low-dose-AET- and vehicle-treated 
LLC mice (right) (n=3 biological replicates). d, Agilent cDNA array expression 
of chemokines and chemokine receptors in monocytic MDSCs from bone 
marrow or lungs of low-dose-AET- and vehicle-treated LLC mice (n=3 biological 
replicates). False-discovery-rate (FDR)-adjusted Pvalues of CCR2 are 0.005 
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~Ccr2 KO (n = 10) ~Ccr2 KO (n = 10) 


(bone marrow) and 0.016 (lung). Cxcr7is also known as Ackr3. e, Effect of 
low-dose AET on Ccr2 expression in monocytic MDSCs from bone marrow 
(BM-M-MDSCs) from LLC mice on day 3, by quantitative PCR (left) 

(n=3 biological replicates) and FACS (right). In the FACS plot, red represents 
low-dose AET and blue represents mock treatment. f, Transwell migration 
assay of sorted monocytic MDSCs from bone marrow of low-dose-AET- or 
vehicle-treated LLC mice (day 3) induced by CCL2 for 60 min. Fold changes are 
normalized to the migration of the cells in the unstimulated mock group (set 

at 1) (n=3 biological replicates). g, Left, FACS showing representative results of 
lung MDSCs in Ccr2-knockout (KO) and wild-type (WT) LLC and HNMOO7 mice 
at day 3. Right, Kaplan-Meier curves showing disease-free survival and overall 
survival of Ccr2-knockout and wild-type LLC (top) and HNMOO7 (bottom) mice. 
Two-sided log-rank test. In a-f, two-sample, two-sided t-tests were used. Bars 
show mean¢ts.e.m. 


immunofluorescence staining until day 6 after resection (Fig. 1b). By con- 
trast, CD11b* cells—including CD11b*GRI* cells—are already substantially 
increased inthe lung on the day of resection (day O) (Fig. 1c). CD11b*GR1* 
cells collected from the lungs on day 3 after primary tumour resection 
(before visible metastases), inhibited both T cell proliferation and 
activation in vitro, which suggests that recruited CD11b*GRI1* cells in 
the lungs are functional MDSCs (Extended Data Fig. 3a). These MDSCs 
were the most-increased immune component of all CD11b* cells in 
tumour-bearing mice immediately before resection. Notably, even after 
resection, these MDSCs in the LLC and HNMOO7 models persisted until 
day 12 in the lung—but not in the liver—as the predominant immune 
cells, compared to non-tumour-bearing mice (Fig. Ic, Extended Data 
Fig. 3b, c). We then used pep-H6, a peptibody that selectively targets 
MDSCs and has a minimal effect on other immune components, to 
deplete MDSCs” (Fig. 1d). Peptibody-mediated depletion of MDSCs 
resulted in an increase in both disease-free survival and overall 
survival of the mice (Fig. 1d). By contrast, intravenous injection of 
MDSCs isolated from the lung of day-3 LLC mice induced metastases 
sooner and resulted in shorter overall survival times (Fig. le), indicating 
that these MDSCs had an important role in metastasis in this system. 
Our results indicate that the accumulation of MDSCs in the lungs pre- 
ceded metastasis development and delineated premetastatic niches. 
To test the differential role of monocytic (CD11b*Ly6C"#"Ly6G ) and 
polymorphonuclear (CD11b*Ly6C'Ly6G*) MDSCs in the formation 
of lung metastases, we transferred monocytic or polymorphonuclear 
MDSCs fromthe bone marrow of LLC mice. We found that—compared to 
vehicle—transferring monocytic MDSCs (but not polymorphonuclear 
MDSCs) resultedina higher rate of lung metastasis at day 3in LLC mice, 
which suggests that monocytic MDSCs have a predominant role in 
establishing the premetastatic microenvironment (Fig. le). 


Low-dose AET impedes migration of MDSCs 


Ithas recently been found that low doses of 5-azacytidine and entinostat 
are well-tolerated and clinically effective in heavily pretreated patients 
with NSCLC’®, and that these treatments target MDSCs!*”°. Low-dose 
5-azacytidine (100 nM) and entinostat (50 nM) in vitro had limited 
effect on the proliferation of LLC1, HNMOO7 and 4TI1 cells (Extended 
Data Fig. 4a). Similarly, these doses did not influence the viability or 
apoptosis of MDSCs sorted from the bone marrow of LLC mice 
and mice of the HNMOO7 model (hereafter HNMOO7 mice) (Extended 
Data Fig. 4b, c). We determined the in vivo doses of 5-azacytidine and 
entinostat (0.5 mg per kg body weight per day and 5 mg per kg body 
weight per day, respectively) that had no effect on primary tumour 
growth and that did not cause weight loss in immune-compromised 
mice with LLC and HNMOO7 tumours (Extended Data Fig. 4d-f). These 
doses also had a limited effect on the proliferation and apoptosis of 
CD45.I* donor cells in vivo (Extended Data Fig. 4g, h). Importantly, in 
our mouse models, these doses decreased MDSCs and niche-promoting 
molecules in the premetastatic lung (Fig. 1f, Extended Data Fig. 5a—-e). 
Moreover, the percentage of donor CD45.1* MDSCs in the lung is not 
affected by low-dose AET in the control sham-surgery mice (tumour- 
naive recipient mice) (Extended Data Fig. 5f). On the basis of these 
findings, we hypothesize that low-dose AET can inhibit the accumula- 
tion of MDSCs in the lung and prevent the formation of premetastatic 
niches in our models of pulmonary metastasis. 

When CD45.1* monocytic or polymorphonuclear MDSCs (5 x 10° cells 
each) were adoptively transferred on day 0 to CD45.2 mice, 36 h after 
transfusion CD45.1* cells decreased by 40-80% in the lungs of recipient 
mice treated with low-dose AET (Fig. 2a, Extended Data Fig. 6a). When 
thesame numbers of CD45.1° monocytic or polymorphonuclear MDSCs 
isolated from bone marrow of mock- (day 3) and low-dose AET-treated 
mice (day 3) were adoptively transferred into CD45.2 mice on day 0, 18h 
after transfusion there were significantly fewer CD45.1' cells from mice 
treated with low-dose AET than from vehicle-treated mice in the lungs 


of CD45.2' recipient mice, as expected (Fig. 2b, Extended Data Fig. 6b). 
These results demonstrate that low-dose AET impedes the migration 
of MDSCs to the premetastatic microenvironment in the LLC model. 
Together with our finding that only the transfusion of monocytic (and 
not of polymorphonuclear) MDSCs increases lung metastases (Fig. le), 
these results showed that—although low-dose AET impairs the migra- 
tion of both monocytic and polymorphonuclear MDSCs-—the role of 
low-dose AET in targeting the trafficking of monocytic MDSCs may 
be more important than its targeting of polymorphonuclear MDSCs 
in our LLC model. 

To identify differences in MDSCs from the lung and bone marrow 
of mock- and low-dose-AET-treated mice on day 3 after resection, 
we compared the gene expression of monocytic MDSCs (excluding 
differentiated MHC-II* and F4/80* macrophages) sorted from thesetwo 
groups in LLC mice (Fig. 2c). Gene-set enrichment analysis (GSEA) of 
monocytic MDSCs from the lung showed that low-dose AET induced a 
substantial change in gene sets associated with immune-cell chemot- 
axis and migration (Extended Data Fig. 6c). CCR2 expression in mono- 
cytic MDSCs from both the bone marrow and lung was significantly 
downregulated in the low-dose-AET group (Fig. 2d). Because CCR2 is 
a key regulator of monocytic cell migration from the bone marrowto 
thetumour microenvironment*”, these data suggest that low-dose AET 
may affect the trafficking of monocytic MDSCs to the premetastatic 
lung at least in part by downregulating CCR2. 

Quantitative PCR and flowcytometry confirmed that both messenger 
RNA (mRNA) and protein levels of CCR2 in monocytic MDSCs from bone 
marrow decreased after low-dose AET (Fig. 2e). Monocytic MDSCs from 
bone marrow collected on day 3 from LLC mice that were treated with 
low-dose AET show reduced migration in a transwell assay after induc- 
tion with CCL2 (Fig. 2f). Both the absolute number and percentage of 
monocytic MDSCs in the lung premetastatic microenvironment are 
negligible in Ccr2-knockout mice, and differed from wild-type mice 
(Fig. 2g). Compared to the wild-type C57BL/6 mice, Ccr2-knockout 
mice have a longer disease-free survival and overall survival both in 
the LLC and HNMO07 models (Fig. 2g). 

We next tested the mechanism of action of low-dose AET on CCR2 
expression in monocytic MDSCs from bone marrow. Database for 
Annotation, Visualization and Integrated Discovery (DAVID) path- 
way analysis reveals that the activity of the NF-KB signalling pathway 
was significantly downregulated in bone-marrow monocytic MDSCs 
from LLC mice that have been treated with low-dose AET (Extended 
Data Fig. 6d). In monocytic MDSCs from bone marrow, low-dose AET 
resulted ina highly significant reduction in RELB and p32 activation, 
compared to that found in mock-treated mice (Extended Data Fig. 6e). 
There was alimited effect on p50 and p65 activation. Furthermore, three 
days of treatment with BMS-345541 (a highly selective IKB kinase (IKK) 
allosteric site inhibitor) resulted in decreased expression of CCR2 in 
monocytic MDSCs from bone marrow in vivo (Extended Data Fig. 6f). 
Although we cannot rule out a direct effect of low-dose AET on CCR2 
expression (as well as other signalling pathways), our findings suggest 
that low-dose AET treatment may affect—at least in part—the expression 
of CCR2 in monocytic MDSCs from bone marrow via the modulation 
of the noncanonical NF-KB pathway”. 

CXCR2 and CXCR1 are known to be important for their role in traf- 
ficking polymorphonuclear MDSCs from the bone marrow to the 
tumour microenvironment”. We found that CXCR2 is downregu- 
lated in polymorphonuclear MDSCs from both the bone marrow and 
lung by low-dose AET in the LLC model (Extended Data Fig. 6g). The 
migration of polymorphonuclear MDSCs from the bone marrow of 
mice treated with low-dose AET is significantly decreased after induc- 
tion with CXCL1in a transwell migration assay (Extended Data Fig. 6h). 
Thus, low-dose AET may inhibit the trafficking of both monocytic 
and polymorphonuclear MDSCs from the bone marrow to the prem- 
etastatic microenvironment by downregulating CCR2 and CXCR2 
expression, respectively. 
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Fig. 3 |Low-dose AET skews monocytic MDSCs towards an interstitial 
macrophage-like population inthe lung premetastatic microenvironment. 
a, GSEA analysis revealed that macrophage or myeloid differentiation and 
activation gene sets were upregulated (left) in monocytic MDSCs fromthe 
lungs of LLC mice treated with low-dose AET. Representative upregulated 
GSEA plots with core-enriched genes (middle and right). NES, normalized 
enrichment score. Colour gradation is representative of log-transformed fold 
change over mock (n=3 biological replicates). Gene-set enrichment Pvalues, 
NES values and FDR values reported are calculated with 1,000 permutations in 
the GSEA software. FDR q< 0.25 was deemed significant. Gpr116is also known 
as Adgrf5.b, Significant changes of representative transcription factors (FDR- 
adjusted P< 0.05) associated with monocytic differentiation (n=3 biological 
replicates). c, FACS showing the differentiation of sorted monocytic MDSCs 

in vitro. Splenic monocytic MDSCs fromLLC mice were cultured for 3 days with 
tumour-conditioned medium (n=3 biological replicates). d, The top 200 


Skewing monocytic MDSC differentiation 

Further query of the GSEA-derived data indicated that gene sets related 
to macrophage or myeloid differentiation and activation were upregu- 
lated in monocytic MDSCs from the lungs of LLC mice treated with 
low-dose AET (Fig. 3a). Moreover, DAVID pathway analysis reveals that 
the NF-KB and PPAR signalling pathways were significantly downregu- 
lated and upregulated, respectively (Extended Data Fig. 7a). Consist- 
ent with differential regulation of these pathways occurring during 
the monocyte-to-macrophage differentiation”, there is a preferential 
increase in transcription factors associated with monocytic differen- 
tiation that are mainly macrophage-related (EGR1, EGR2, MAFB, MAF 
and PPARy)”>”, and not in those that are dendritic-cell-related (SPIB, 
RELB, STAT3 and FOXP1)°”’ (Fig. 3b). The related factors have also 
been validated in vitro and in vivo by quantitative PCR and western 


288 | Nature | Vol579 | 12 March 2020 


O10 104 108 

CD45.1 ——> 
significantly upregulated genes (FDR-adjusted P< 0.05) were mapped back to 
thereference ImmGen populations. The following populations from naive mice 
were used: lung CD103* dendritic cells (ImmGen code, DC_103+11b-_Lu); 
CD11b*CD24" lung dendritic cells (DC_103-11b+24+_Lu); lung interstitial 
macrophages (MF_l1c-11b+_Lu);lung alveolar macrophages (MF_Alv_Lu); and 
lung monocytes (Mo_Lu). The expressions of these 200 genes in 6 populations 
were transformed by zero-mean normalization. Two-sided Mann-Whitney U- 
test. n=3 biological replicates. e, FACS showing that low-dose AET canskewthe 
differentiation of transferred monocytic MDSCs towards aninterstitial 
macrophage-like phenotype in vivo. Purified 5 x 10° CD45.1‘ monocytic MDSCs 
from bone marrow (day 0) were adoptively transferred into CD45.2° recipient 
mice within 24 h of resection (n=3in each group) inthe LLC model. Mice 
received low-dose AET or vehicle treatment for 36 h.Inb, cand e, two-sample, 
two-sided ¢-tests were used. All bars show mean +s.e.m. 


blot analyses (Extended Data Fig. 7b, c). In vitro epigenetic treatment of 
splenic monocytic MDSCs from mice bearing LLC tumours decreased 
the percentage and absolute number of monocytic MDSCs significantly, 
whereas those of macrophages increased significantly (Fig. 3c)—imply- 
ing that low-dose AET might promote the differentiation of monocytic 
MDSCs into macrophages. 

Characterization using Immunological Genome Project (ImmGen) 
criteria further defined the resulting populations inthe lung premeta- 
static microenvironment”®. Our top 200 upregulated genes in mono- 
cytic MDSCs from the lungs of mice treated with low-dose AET map 
specifically to lung interstitial macrophages (Fig. 3d, Supplementary 
Table 2). Additionally, flow cytometry shows that lung interstitial mac- 
rophages (CD11b*CD11c°CD64"2"MHC-II"CD24-)” increase in mice 
treated with low-dose AET (Extended Data Fig. 8a, b). This observationis 
confirmed by adoptively transferring 5 x 10° CD45.1* monocytic MDSCs 
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Fig. 4| Low-dose AET inhibits pulmonary metastases and prolongs overall 
survival in mouse models, mainly by affecting MDSCs. a, Representative 

H &E-stained images of lung sections from LLC mice treated with low-dose AET 
or vehicle at different time points after surgery. Scale bars, 2mm. Graph shows 
the area and numbers of metastatic nodules. At each time point, three mice 
were killed for analysis. For each sample, sections from three levels were 
analysed. Tumour area was quantified using Aperio Imagescope software. Two- 
sample, two-sided ¢-test. Allbars are mean +s.e.m.b, Kaplan-Meier curves 
showing the disease-free survival and overall survival of LLC mice treated with 
low-dose AET or vehicle. c, Kaplan-Meier curves showing the disease-free 


into CD45.2* mice. As expected, 36 hafter transfer, the CD45.1° mono- 
cytic MDSCs differentiated towards a more-interstitial macrophage- 
like phenotype in the lungs of CD45.2 mice treated with low-dose AET 
(Fig. 3e, Extended Data Fig. 8c). In summary, low-dose AET may skew 
monocytic MDSCs towards an interstitial macrophage-like pheno- 
type in the lung premetastatic microenvironment. This acquisition 
of a MDSC-to-macrophage program is associated with CCR2 down 
regulation (a known consequence of monocyte-to-macrophage dif- 
ferentiation*°), thus also establishing the functional implications of 
our transcriptional data. Moreover, we found that transferring wild- 
type monocytic MDSCs rescued metastases in Ccr2-knockout mice 
and resulted in shorter overall survival times, whereas transferring 
wild-type polymorphonuclear MDSCs did not. By contrast, transfer- 
ring wild-type monocytic MDSCs from mice treated with low-dose 
AET brought only limited change to both the lung metastatic rate and 
overall survival time of Ccr2-knockout mice (Extended Data Fig. 8d, e). 
These findings imply that the decreased accumulation of monocytic 
MDSCs in the premetastatic niche owing to low-dose AET is mainly 
dependent on CCR2 signalling. 
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survival and overall survival of LLC mice treated with paclitaxel plus cisplatin 
chemotherapy, low-dose AET or vehicle. d, Kaplan-Meier curves showing the 
disease-free survival and overall survival of mice treated with vehicle, IgG 
(isotype control), anti-CD4/CD8 (T-cell-depletion antibody), low-dose AET and 
anti-CD4/CD8 in combination with low-dose AET in the LLC model. e, Kaplan- 
Meier curves showing the disease-free survival and overall survival of mice 
treated with vehicle, CCR2 antagonist (RS504393) (Sigma), low-dose AET and 
RS504393 in combination with low-dose AET in the LLC model. In b-e, two- 
sided log-rank tests were used. *P< 0.05, **P< 0.01, ***P< 0.001. 


Low-dose AET increases overall survival 


In our mouse models, low-dose AET reduces pulmonary metastases, 
as shown by H&E staining, gross pathological findings and cone-beam 
computed tomography imaging (Fig. 4a, Extended Data Fig. 9a-c).In 
all three mouse models, low-dose AET prolongs disease-free survival 
and overall survival time (Fig. 4b, Extended Data Fig. 9d). Compared to 
adjuvant chemotherapy, low-dose AET confers both alonger disease- 
free and overall survival in the LLC model (Fig. 4c). It has previously 
been found that epigenetic therapy results in a strong attraction of 
CD8* T cells to NSCLC tumours anda reversal of the immune-exhaustion 
profile of these cells’*. However, when we depleted CD4.and CD8T cells 
(from day 1 to day 6) in our LLC model, we observed that there was 
no difference during low-dose AET either in disease-free or overall 
survival—indicating that the reduction in metastases was independent 
of T cells (Fig. 4d, Extended Data Fig. 9e). Finally, the combination of 
low-dose AET and CCR2 antagonists in our LLC and HNMOO7 models 
shows synergy in disease-free and overall survival (Fig. 4e, Extended 
Data Fig. 9f, g). 
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Our observation that the lung premetastatic niche persists even after 
resection has translational implications. First, the results stress a poten- 
tial use of epigenetic treatment as an adjuvant therapy focused onthe 
perturbation of MDSCs. Second, our findings suggest that combining 
low-dose AET with CCR2 antagonists may be an emerging paradigm 
to prevent the accumulation of MDSCs in the premetastatic niche, 
thus inhibiting metastases and extending survival. Third, we provide 
compelling evidence that if monocytic MDSCs successfully migrate 
to the lung premetastatic niche, epigenetic modifiers can skew the 
population to amore-interstitial macrophage-like phenotype, antago- 
nizing their prometastatic functionality in that microenvironment 
(Extended Data Fig. 10). The post-resection recurrence of early-stage 
cancer (especially for the tumour types studied here) is a consider- 
able clinical challenge, and effective adjuvant therapies are lacking. 
Low-dose AET represents a potentially efficacious treatment to use 
in the absence of manifest primary tumour burden after resection. 
Our therapeutic paradigm may augment the efficacy of early cancer 
resection (which still represents the most effective treatment), while 
robustly inhibiting recurrence. We plan to translate these preclinical 
findings toa clinical trial in early-stage cancer using low-dose AET and 
CCR2 antagonists to prevent metastatic recurrence. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Cell lines and cell culture 

Lewis lung carcinoma cells (LLC1) and 4T1 cells were obtained from 
ATCC and cultured in RPMI 1640 medium (GIBCO) supplemented 
with 10% fetal bovine serum (FBS) (GIBCO). HNMO0O7, a p53-null 
mouse oesophageal squamous cell carcinoma cell line transformed 
by Hras°”", was provided by S. Singhal and cultured in Dulbecco’s modi- 
fied Eagle’s medium (DMEM) (GIBCO) containing FBS at 10% v/v. Cells 
were maintained in a humidified incubator at 37 °C in the presence of 
5% CO, and passaged every 2-3 days. Cell lines were routinely tested 
for mycoplasma and immediately tested upon suspicion. None of the 
celllines used in the reported experiments tested positive. 

LLC1 and HNMOO7 cells were kept in RPMI 1640 medium or DMEM 
with areduced (3%) serum concentration for 48 h. After that time, 
supernatants were collected, aliquoted and kept at -80 °C. For MDSC 
culture, tumour supernatant (3% FBS) containing 10 ng/ml GM-CSF 
(tumour-conditioned medium) was used. 


LLC tissue and HNMO07 tissue 
The LLC tissue (P3 working stock)*, labelled with a Pol2-Luc/GFP len- 
tiviral vector, was provided by G. Merlino. 

To generate HNMO07 tissues, mice were injected subcutaneouslyin 
the right flank with 1.0 x 10° viable HNMOO7 cells in 0.1 ml of PBS and 
Matrigel (1:1, v/v). On reaching 500-750 mm*, tumours were surgically 
removed, and lung metastases were monitored periodically by cone- 
beam computed tomography (CBCT) imaging. This transplantation 
was referred to as passage zero (PO). The lung nodules wereidentified 
by CBCT imaging, resected and then subcutaneously transplanted into 
other mice (passage one (P1)). The same procedure was repeated eight 
times (from P2 to P9), and the lung nodules were then collected and 
subcutaneously transplanted into 50 miceas passage ten (P10). When 
the P10 tumours reached 500-750 mm’, they were collected into 100 
tubes and frozen in liquid nitrogen as working stocks. All the HNMO0O7 
tumours in our studies were expanded from the P10 working stock. 


Mice 

Female C57BL/6 mice and BALB/c mice were purchased from Charles 
River Laboratories. Female mice congenic in mouse CD45 at the Ly5 
locus (B.6SJL-Ptprc* Pepc’/Boy) Ly5.1) were obtained from Jackson 
Laboratory. Female NOD/SCID/g-chain knockout (NSG) mice, bred and 
housed at the Johns Hopkins Animal Care Facility, were used. B6.129S4- 
Ccr2'™4/| mice were provided by S. A. McGrath-Morrow. All mice were 
maintained in pathogen-free conditions and used for experiments at 
age of 6-8 weeks. The Guide for the Care and Use of Laboratory Animals 
published by the National Institutes of Health (NIH publication 96-01, 
revised 1996) was followed, and all protocols were approved by the 
Johns Hopkins University Animal Care and Use Committee. 


Lung spontaneous metastasis model 

For preclinical studies, a vial of P3 LLC tumour was expanded subcuta- 
neously in 10 C57BL/6 mice (equals P4). The expanded tumours were 
resected at 500-750 mm? and transplanted subcutaneously into the 
required number of mice forthe actual drug study (P5). Inthe HNMOO7 
model, a vial of PIO HNMOO7 tumour was expanded subcutaneously 
in10 C57BL/6 mice (equals P11). The expanded tumours were resected 
at 500-750 mm? and transplanted subcutaneously into the required 
number of mice for the actual drug study (P12). For the 4T1 model, 
2 x 10° 4T1 cells were injected into the BALB/c mouse mammary fat 
pad using a tuberculin syringe. For all models of spontaneous lung 
metastasis, the primary tumours were surgically removed at 500-750 


mm, and the mice were randomized into treatment groups. For the 
sham surgery study, tumour-naive mice underwenta 2.0-cm skin inci- 
sion and closure in right flank. Mice were treated as follows: azaciti- 
dine 0.5 mg/kg (PBS vehicle) subcutaneously injected and daily for 14 
days. entinostat 5.0 mg/kg (LLC and HNMOO7 models) and 2.5mg/kg 
(4T1 model) (1% dimethyl sulfoxide (DMSO) in PBS vehicle) intraperito- 
neally injected daily for 14 days. In vivo, the CCR2 inhibitor, RS504393 
(Sigma) 2 mg/kg (PBS vehicle) or RS102895 (Sigma) 2 mg/kg (PBS vehi- 
cle) was injected intraperitoneally daily for 14 days. In vivo, the NF-KB 
inhibitor BMS-345541 (Selleck) 30 mg/kg/d or 75 mg/kg/d (3% Tween 
80 and sterile water) was orally administered daily for 3 days. The adju- 
vant chemotherapy regimen was paclitaxel 20 mg/kg/week injected 
intraperitoneally plus cisplatin 3 mg/kg, injected intraperitoneally 
twice a week for 2 weeks. The size of the subcutaneous tumours was 
measured manually and calculated by V(mm?) = 0.5 x L x W, in which 
Lislengthand Wis width, in millimetres. The metastasis or recurrence 
was monitored by CBCT imaging. The time from primary-tumour resec- 
tion to metastasis detection by CBCT imaging and to death was defined 
as the disease-free survival and overall survival period, respectively. 


Xenograft studies in NSG mice 

LLC tissue (P5) and HNMO07 tissue (P12) were transplanted subcutane- 
ously into the flanks of mice. Drug treatments were started 6-8 days 
after transplantation, when palpable tumours could be discerned. 
Treatment was continued for the entire duration of the study and mice 
were killed before tumour volumes exceeded 2,000 mm?. 


CD4* and CD8‘° T cell depletion experiment 

CD4 antibody (BioXcell, clone GK1.5) and CD8 antibody (BioXcell, clone 
116-13.1)-mediated CD4* and CD8* T cell depletion in LLC mice was 
initiated within 24 h of primary-tumour resection. Mice were injected 
intraperitoneally with 500 pg of CD4 and CD8 antibodies at day land 
day 4, respectively, after resection. As controls, mouse IgG2a isotype 
control (BioXcell, clone C1.18.4) and rat IgG2b isotype control (BioX- 
cell, clone LTF-2) were injected into the control mice. CD4* and CD8* 
T cell depletion was verified by FACS analysis of peripheral blood cells. 


MDSC depletion experiment 

Synthetic, complementary double-stranded oligonucleotides encoding 
H6 peptide (TIK), and an irrelevant control peptibody (pep-irrel) (D1) 
were provided by L. W. Kwak”. The recombinant peptibodies used in 
allin vivo studies were produced by Kempbio following an established 
protocol”. After LLC tumour resection, groups of C57BL/6 mice were 
injected via their tail veins with 50 pg of peptibody (TIK) once per day 
from day 1 to day 14. Control mice received pep-irrel or PBS. MDSC 
depletion was verified by assessment of the MDSC population in the 
premetastatic lung. 


CBCT guided systems 
To monitor the lung metastases, the mice were subjected to CBCT 
imaging (laboratory of K.K.-H.W.) at different time points after primary 
tumour resection. The standard procedure for CBCT imaging has previ- 
ously been described. 


Drug reagents 

Azacitidine (Sigma) was dissolved in distilled water at 500 pM (in vitro) 
and 5 mg/ml (in vivo). Entinostat (MS-275) provided by P. Ordentlich 
was dissolved in DMSO to concentrations of 500 pM (in vitro) and 1 
mg/ml (in vivo). Both azacitidine and entinostat were aliquoted and 
stored at—-80 °C and diluted to needed working concentrations before 
use. BMS-345541 (Selleck) was formulated as a2 mg/ml solution in 3% 
Tween 80, water and stored at 4 °C. RS504393 (Sigma) and RS102895 
(Sigma) were first dissolved in DMSO and stored at —80 °C. The stock 
solution was dissolved in normal saline at 2 mg/ml for intraperitoneal 
injection. All the stored reagents were for single use only. 
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Cell viability assays (CCK8 assay) 

Equal numbers of viable cells were plated in 96-well plates at the fol- 
lowing cell seeding densities per well: LLC1 cells (2 x 10°), HNMOO7 
cells (2 x 10°), 4T1 cells (2 x 10°), monocytic MDSCs (1 x 10*) and pol- 
ymorphonuclear MDSCs (1 x 10*). Cells were incubated with 100 ul 
drug-supplemented medium, treated with DMSO (vehicle) at 0.1% or 
the following drug concentrations standardized to 0.1% DMSO final 
concentration. For LLC1 cells, HNMOO7 cells and 4T1 cells, the treat- 
ment regimens were: azacitidine, 25 nM, 50 nM, 100 nM, 200 nM and 
400 nM; entinostat, 25 nM, 50 nM, 100 nM, 200 nMand 400 nM; com- 
bined treatment, entinostat 50 nM plus azacitidine 25 nM, 50 nM, 
100 nM, 200 nM and 400 nM. After incubation for 24 h, 48 hor 72h, 
cell viability was measured using a Cell Counting Kit-8 (CCK8) assay 
(Dojindo) according to the manufacturer’s instructions. The optical 
density at 450 nm (OD,,,,,,) Was measured using a multiwell plate reader 
(Micro-plate Reader; Bio-Rad). 


Apoptosis assay 

Apoptotic assays in cultured cells or transferred cells were performed 
using the Annexin V-FITC apoptosis detection kit (BD Pharmingen). 
Assays were performed according to the manufacturer’s protocol. 


T cell suppression assay 

MDSCs were isolated using the mouse MDSC isolation kit, according 
tothe manufacturer's protocol (Miltenyi Biotec), or by flowcytometry. 
CD8*T cells were isolated from the spleens of tumour-bearing mice 
by magnetic separation as previously described”, and then labelled 
with 5 uM carboxyfluorescein succinimidyl ester (CFSE) (Invitrogen). 
The CFSE-labelled T cells were incubated with CD11b*Ly6C'™Ly6G* or 
CD11b*Ly6C"®*Ly6G cells isolated from the lung premetastatic niche 
(day 3) at different ratios in a 96-well plate cultured with CD3/CD28 
beads (GIBCO) at 37 °C with 5% CO.,. After 72h, the cells and supernatant 
were collected. Cells were stained with anti-CD8 antibody so that CD8* 
Tcells could be specifically gated and examined, and the CFSE fluores- 
cence intensity of the CD8* T cells was determined by flowcytometry. 
IFNy concentrations in the supernatant were determined by enzyme- 
linking immunosorbent assay (ELISA) (R&D systems) according tothe 
manufacturer’s instructions. 


In vitro transwell migration assay 

Monocytic or polymorphonuclear MDSCs isolated from the bone mar- 
row by FACS were incubated in tumour-conditioned medium in the 
upper chamber of transwell inserts (5-um pore for monocytic MDSCs 
or 3-um pore for polymorphonuclear MDSCs) with CCL2 (5ng/ml or10 
ng/ml) or CXCL1 (20 ng/ml or SO ng/ml) inthe lower chamber. Transmi- 
grated monocytic or polymorphonuclear MDSCs were enumerated fol- 
lowing a 60-min or 120-min incubation. Fold changes were normalized 
tothe migration of the cells in the unstimulated mock group (set at 1). 


FACS profiling and sorting of immune cells 

FACS profiling and sorting of immune cells were performed after both 
lungs were collected from LLC mice, HNMOO7 mice and mice of the4T1 
model (hereafter 4T1 mice) in mock-treated or low-dose-AET-treated 
groups. Tissue was then digested for 30 min at 37 °C in digestion buffer 
(RPMI, FBS (5%), collagenase type 1 (Sigma-Aldrich, 0.2%), collagenase 
type 2 (Sigma-Aldrich, 0.2%) and DNase I (Roche, 50 U/ml)), minced and 
strained through a 40-ym cell strainer to obtain a single-cell suspen- 
sion", Perfused livers were cut into small fragments and incubated 
(37 °C, 250 rpm for 30 min) with 5 ml digestion buffer (5% FBS, 0.5 mg/ 
ml collagenase VIII (Sigma-Aldrich) and 0.1 mg/ml DNasel in PBS). This 
was followed by 3 cycles of washing with PBS at 400 rpm from whichthe 
supernatant was taken, omitting the parenchymal cell pellet. Spleens 
were mechanically dissociated, and red blood cells were lysed in 1x lys- 
ing buffer (BD Bioscience). Femurs and tibias from mice were dissected, 


and the bone marrow was flushed with RPMI 1640 medium. The cells 
were filtered through a 70-um cell strainer. Blood was obtained by tail 
vein puncture or by heart puncture after exposing the organ. Blood 
was collected into tubes containing 1.0 ml of PBS with 2 mM EDTA. 
Red blood cells were then lysed with lysing buffer and the cell pellet 
was washed twice in PBS. Cells were counted and then blocked with rat 
monoclonal anti-CD16/CD32 (Fc block antibody) in PBS for 30 min at 
4 °C. Cells werethen stained with antibodies. For intracellular antigens, 
cells were fixed and permeabilized in fixation/permeabilization buffer 
(eBioscience) for 30 min at 4 °C, washed and stained with intracellular 
antibodies for 30 min at 4 °C. Information about all the antibodies used 
is provided in Supplementary Table 3. 


Cell transfer experiments 

For adoptive transfer experiments, MDSCs were sorted fromthe bone 
marrow of CD45.1* LLC mice by FACS. A total of 5 x 10° monocytic or 
polymorphonuclear MDSCs were transferred into aCD45.2 mouse via 
tail-vein injection within 24 h of resection. 


Immunofluorescence analysis 

Immediately after perfusion, mice lungs were embedded in optimal 
cutting temperature compound, snap-frozen and stored at -80 °C until 
analysed. Five-micrometre cryosections were cut, air-dried, acetone- 
fixed and blocked with10% normal goat serum (30 min, Sigma-Aldrich). 
To detect MDSCs, slides were stained with rat anti-GR1 antibody (1:500, 
Biolegend). To detect T cells, slides were stained with rabbit anti-CD4 
antibody (1:200, Abcam) and rat anti-CD8a antibody (1:100, ebiosci- 
ence). To detect GFP signal, slides were stained with rabbit anti-GFP 
(1:200, Abcam). Sections were washed in PBS 3 times before adding 
goat anti-rabbit Alexa Fluor 488 and goat anti-rat TRITC secondary 
antibodies in blocking solution for 45 min at room temperature. All 
sections were mounted in ProLong Gold Antifade Mountant with 
DAPI (Thermo Fisher Scientific) and imaged with a Nikon Confocal 
Microscope C1 and analysed with EZ-C1 software (Nikon). 


Gene-expression analysis 

For genome-wide gene expression analysis, we used gene-expression 
arrays at the Sidney Kimmel Comprehensive Cancer Center Microar- 
ray Core at the Johns Hopkins University. MDSCs were isolated from 
bone marrow or lungs by FACS. RNA preparations were obtained from 
monocytic MDSCs from lung or bone marrow pooled from 6-8 mice 
per sample (low-dose AET group) or from 1-2 mice per sample (vehicle- 
treatment group). After total cellular RNA was extracted using the 
TRIzol method (Life Technologies), RNA concentration was determined 
using the NanoDrop machine and software (Thermo Fisher Scientific). 
Around 400 ng of total RNA was used to generate cDNA with the Quanti- 
Tect Reverse Transcription Kit (Quanta Biosciences). Transcriptomic 
profiles were obtained using Agilent 4 x 44K mouse Gene Expression 
v.2 arrays following the manufacturer’s instructions. Microarray data 
were analysed with the R package limmaas described inthe manual. In 
brief, background signals were corrected using the normexp method 
(with offset = 50). Then, normalization within arrays and between arrays 
was performed using the losses and Aquantile methods. The differential 
gene expression was defined as log,-transformed fold change > abso- 
lute (0.5) andanFDR adjusted P< 0.05. Ranked lists of log,-transformed 
fold change were analysed using GSEA by the Broad-Institute-developed 
data packages™. Significantly enriched gene sets were defined using an 
FDR cut-off of <0.25. P values were defined as <O.05 when comparing 
treated conditions versus controls. 


DAVID analysis of median absolute deviation derived genes 
Gene Ontology and Kyoto Encyclopedia of Genes and Genomes 
(KEGG) pathway enrichment analyses were conducted using the DAVID 
Bioinformatics resources database®**. Only categories that were below 
the FDR adjusted P value of 0.05 were considered. 


Quantitative real-time PCR 

Quantitative real-time PCR was performed with SYBR Green I detec- 
tion chemistry (Bio-Rad Laboratories) using the Applied Biosystems 
7500 Fast Real-Time PCR System and its software. B-Actin was used as 
areference gene. The specific primers used for quantitative real-time 
PCR are listed in the Supplementary Table 4. The AAC, method was 
used to calculate relative expression levels. 


Immunoblotting 

Protein was extracted by RIPA buffer containing protease and 
phosphatase inhibitors. Six-to-ten per cent Bis-Tris protein gels were 
equally loaded with 30 pg protein, electrophoresed at 110 V, and elec- 
trotransferred to PVDF membranes. Membranes were blocked with 
10% milk in TBST and immunoblotted with the following antibodies: 
rabbit polyclonal anti-MMP-9 (Abcam, 1:1,000), rabbit monoclonal 
anti-TGFB (Abcam, 1:1,000), rabbit polyclonal anti-ARG1 (Abcam, 
1:1,000), rabbit monoclonal anti-VEGF-A (Abcam, 1:1,000), rabbit 
monoclonal anti-S100A8 (Abcam, 1:1,000), rabbit monoclonal anti- 
TNF (Abcam, 1:1,000), rabbit monoclonal anti-IL-6 (Cell Signaling 
Technology, 1:1,000), rabbit polyclonal anti-DNMTI (Cell Signaling 
Technology, 1:1,000), rabbit monoclonal anti-EGR1(Abcam, 1:1,000), 
rabbit monoclonal anti-EGR2 (Abcam, 1:1,000), rabbit monoclonal 
anti-PPARy (Cell Signaling Technology, 1:1,000), mouse monoclonal 
anti-MAF-B (Santa Cruz Biotechnology, 1:100), rabbit monoclonal anti- 
p50 (Cell Signaling Technology, 1:1,000), rabbit monoclonal anti-p52 
(Cell Signaling Technology, 1:1,000), rabbit monoclonal anti-RELB 
(Cell Signaling Technology, 1:1,000), rabbit monoclonal anti-p65 (Cell 
Signaling Technology, 1:1,000), mouse monoclonal anti-B-actin (Sigma 
Aldrich, 1:10,000). The loading control antibodies (anti-B-actin) in 
all cases were applied. Information about all the antibodies used is 
provided in Supplementary Table 3. 


NF-kB DNA-binding capability assay 

To measure NF-kB activation, the TransAM NF-kB Family Transcription 
Factor Assay Kit (43296, Active Motif) was used according to manu- 
facturer’s protocol. In brief, bone-marrow-derived monocytic MDSCs 
fromLLC mice treated with vehicle or low-dose AET were isolated, and 
nuclear extracts were prepared in lysis buffer AM2. Nuclear lysates 
were incubated with oligonucleotides containing the NF-kB-binding 
consensus sequence, and specific antibodies were used to detect the 
different subunits within the bound complexes. Quantification was 
performed via colorimetric readout of absorbance at 450 nm. 


H&E staining and imaging 

For histological analysis, lungs were fixed in 10% formalin overnight, 
and subsequently transferred into 70% ethanol, embedded in par- 
affin according to standard protocols. Sections (5 pm) were stained 
with H&E and viewed under the Nikon Eclipse NiE microscope (Nikon 
Instruments). Images from the whole slide were acquired by Nikon Nis 
Element software. For each sample, sections from three levels were 
analysed. The number and area of metastatic sites were quantitated 
using Aperio Imagescope software. 


Adjuvant epigenetic treatment trial 

The J1037 (NCT01207726) study was a randomized phase-II study 
that compared the low-dose AET (5-azacytidine plus entinostat) with 
standard of care (observation) in patients with stage I (T1-2aNO) NSCLC 
after primary tumour resection. The J1037 study was performed in 
full accordance with the guidelines for Good Clinical Practice and the 
Declaration of Helsinki, and all patients gave written informed consent. 
Protocol approval was obtained from the Johns Hopkins Hospital and 
Anne Arundel Medical Center Ethics Committee. An independent data 
monitoring committee reviewed the safety data. The patients from 
the adjuvant epigenetic treatment group received the combination 


of azacitidine at 40 mg/m? on days 1-5 and 8-10 with entinostat at a 
7-mg fixed dose on days 3 and 10 of each 28-day cycle. The primary end 
point was the effect of 5-azacytidine plus entinostat on the hazard of 
3 years of progression-free survival in patients with resected stage I 
non-small-cell lung cancer. Finally, 13 patients were enrolled inthe trial. 
Owing to the difficulty in enrolling patients, the trial was prematurely 
terminated on 1 May 2015. 


Statistical analysis 

Flow and imaging data were collected using FlowJo Version 10.0.7, 
or Summit Version 5.4 (Beckman Coulter). All the experiments were 
performed in biological and technical triplicates. Values reported in fig- 
ures are expressed as the standard error of the mean, unless otherwise 
indicated. Depending on the type of experiment, data were tested using 
two-sample, two-sided t-test, two-sided log-rank test, one-way analysis 
of variance (ANOVA), hypergeometric test or Mann-Whitney U-test. 
Pvalues of < 0.05 were considered significant. *P < 0.05, **P< 0.01, 
**P< (0.001. Statistical analyses were performed with GraphPad Prism 
7.0 (GraphPad Software) or R version 3.6.1 (https://www.r-project.org). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Alldata generated are included in the Article andinits Supplementary 
Information. Source Data for Figs. 1-4, Extended Data Figs. 1, 3-9 are 
provided with the paper. Gene-expression data that support the find- 
ings of this study have been deposited in the Gene Expression Omnibus 
under accession number GSE124539. All data are also available from 
the corresponding authors on reasonable request. 
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Extended Data Fig. 1| Efficacy of low-dose AET on cancer recurrence NSCLC (NCTO1207726). b, Postsurgical recurrence rates inthe observation and 
in patients with stage (T1-2aNO) NSCLC ina phase-II clinical trial.a, Schema 


AET groups. c, Kaplan-Meier curves for disease-free survival in the observation 
for arandomized phase-II clinical trial of AET in patients with stage I (T1-2aNO) and AET groups. P=0.50 by two-sided log-rank test. 
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Extended Data Fig. 2 | Schema outlining the establishment and 
characteristics of the mouse models of pulmonary metastasis. a, Schema for 
establishing the highly aggressive HNMOO7 model. Pulmonary metastases 
were collected and serially subcutaneously implantedin the right flanks of 
mice for 10 passages. b, Schema for establishing the 4T1 model. 

c, Characteristics of mouse cell line (nonselective) and spontaneous mouse 
tissue models of pulmonary metastasis (metastases collected selectively from 
serial pulmonary metastases to produce asolely pulmonary-metastatic 
phenotype). d, Longitudinal gross pathological photographs of bilateral 


pulmonary metastases during the natural history of the LLC model in C57BL/6 
mice from day O to day 15 after surgery. N1, N2 and N3 depict the experiment 
performed in triplicate. Two mice were killed at each time point from day Oto 
day 15 (n=36); data for 18 mice are shown here as representative 
photomicrographs. e, H&E staining of pulmonary metastases in LLC, HNMOO7 
and 4T1 mice. The histology of LLC (day 9), HNMOO7 (day 12) and 4T1 (day 12) 
pulmonary metastases were confirmed by a pathologist. Scale bars, 100 pm. 
Representative data were repeated at least three times with similar results. 
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Extended Data Fig. 3 | CD11b*GRI‘ cells persist as the predominant immune 
cellseven after resectionin the lung premetastatic microenvironment as 
functional MDSCs.a, InLLC mice, lung CD11b*Ly6C"®"Ly6G" and 
CD11b*Ly6C'Ly6G* cells collected at 72 hafter resection both have 
suppressive activity in vitro against CD8a T cells. Freshly isolated 
CD11b*Ly6C"®"Ly6G" or CD11b*Ly6C'™ Ly6G* cells from both lungs at day 3 after 
resection were cocultured with CD8aT cells for 72 hat different ratios (0:1, 1:1, 
2:1, 4:1and 1:0). T cell proliferation and IFNy concentrations in the supernatant 
were measured by FACS (left) and ELISA (right), respectively (n=3 biological 
replicates). Representative data were repeated at least three times with similar 


results. Two-sample, two-sided t-test was used in the comparison with mock 
(CD8aT cells alone). b, Immune-cell profiles of liver in LLC mice. Single-cell 
suspensions from the entire liver were analysed by FACS (n=3 mice per time 
point) at different time points after surgery. NC, negative control (normal liver 
from C57BL/6 mice).c, Immune-cell profiles of both lungs in HNMOO7 miceat 
different time points after surgery. Single-cell suspensions from both lungs 
were analysed by FACS (n=3 mice per time point). NC, negative control (normal 
lungs from C57BL/6 mice). Inb, c, atwo-sample, two-sided t-test was usedin 
comparison with the negative control. All bars show mean + s.e.m.*P<0.05, 
*“P<0.01,***P<0.001. 
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Extended Data Fig. 4| Consideration of combined AET dosing on the basis of 
its effect on mouse models. a, Top, effect of different dosages of epigenetic 
modifiers on the viability of LLC1, HNMOO7 and 4T1 cells in vitro (72h, Cell 
Counting Kit-8). Graphs show the mean of 3 independent experiments; two- 
sample, two-sided t-tests compared with mock. Bottom, effect of low-dose 
5-azacytidine (100 nM) plus entinostat (50 nM) onthe proliferation of LLC1, 
HNMO07 and 4T1 cellsin vitro. A total of 1x 10° viable cells was plated per well. 
Cells were collected at 24, 48 and 72 hand counted using a cell counter (Bio- 
Rad) after Trypan blue exclusion. Graphs show the mean of 3 independent 
experiments; significance at 72 h was determined by one-way ANOVA followed 
by Tukey’s test for multiple comparisons. b, The effect of low-dose 
5-azacytidine (100 nM) plus entinostat (50 nM) onthe viability of MDSCs from 
bone marrow (day 3) of LLC mice (top) and HNMOO07 mice (bottom) in vitro 
(Cell Counting Kit-8). Graphs showthe mean of 3 independent experiments. 

c, The effect of low-dose 5-azacytidine (100 nM) plus entinostat (SO nM) onthe 
apoptosis of MDSCs from bone marrow (day 3) of LLC and HNMO07 mice 

in vitro. Cell apoptosis was measured by FACS at 48 h. The bottom right 
quadrant (annexin-V‘/7-AAD ) and top right quadrant (annexin-V‘/7-AAD‘*) 
represent early and late apoptoticcells, respectively. Graphs show the 
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percentage of total apoptosis (early and late apoptosis) in mock and treatment 
groups (n=3 biological replicates). d, Tumour growth and body weight of NSG 
mice bearing LLC tissue that were treated with different doses of entinostat 
plus 5-azacytidine. Significance at day 12 (top) and day 14 (bottom) was 
determined by one-way ANOVA followed by Tukey’s test for multiple 
comparisons.e, Summary table of tumour growth, body weight and treatment- 
related death of NSG mice bearing LLC tissue. Regimens in red indicate dosages 
withno effect on tumour growth, weight loss or treatment-related death. 

f, Tumour growth and body weight of NSG mice bearing HNMO07 tissue that 
were treated with 5-azacytidine at 0.5mg kg ‘d“ plus entinostat at 5 mg kg*d™ 
or vehicle. Significance at day 14 was determined by two-sample, two-sided t- 
test. g,h, The effect of low-dose AET onthe proliferation (g) and apoptosis (h) 
of donor-derived CD45.1° MDSCs from bone marrow inCD45.2 LLC mice. 
Proliferation and apoptosis of immature (MHC-II) and mature (MHC-II’) 
CD45.1' cellswere measured by FACS at 36 h after transfusion (day 2). Graphsin 
g show the percentage of Ki67* cells (n=3 mice per group). Graphs in hshow the 
percentage of total apoptosis (early and late apoptosis) in mock-treated and 
low-dose-AET groups (n=3 mice per group). Inb,c, g,h, two-sample, two-sided 
t-tests were used. All bars show mean +s.e.m. *P<0.05. 
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Extended Data Fig. 5 | Low-dose AET disrupts the lung premetastatic 
microenvironment, mainly by affecting MDSCs. a, The effect of low-dose AET 
(5-azacytidine 0.5mgkg™d“™ plus entinostat 2.5mgkg‘d“) on MDSCs fromthe 
lung at day 3 after resection in 4T1 mice (n=3 mice per group).b,c, 
Immunofluorescence staining of CD4* and CD8' T cells (b) or GR1' cells (c) from 
the lung premetastatic microenvironment (day 3) in LLC mice with or without 
low-dose AET. Negative control was normal lungs from tumour-free C57BL/6 
mice. Immunofluorescence staining was performed using CD4 (green) and 
CD8 (red) antibodies, or GR1 (red) antibodies. Merged images contain DAPI 
staining for cell nuclei (blue). Original magnification 20x. Representative data 
were repeated at least three times with similar results. d,e, The mRNA (d) and 
protein (e) levels of representative molecular factors known to promote 
premetastatic microenvironment formation from bothlungs of normal mice, 
and mock- or low-dose-AET-treated LLC mice (day 3) were measured by 
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quantitative PCR and western blot. Two-sample, two-sided ¢-test for 
quantitative PCR experiments (n=3 biological replicates). For gel source data, 
see Supplementary Fig. 1. All the experiments were performedin triplicate and 
similar results were obtained. f, Top, graph showing the percentages of donor- 
derived cell subsets (CD45.1° MDSC cells) in the lungs of LLC mice or of sham- 
surgery mice (tumour-naive recipient mice) 36h after surgery. Bottom, graph 
showing the percentages of donor-derived cell subsets (CD45.1° MDSC cells) in 
the lungs of low-dose-AET- or vehicle-treated sham-surgery mice (tumour- 
naive recipient mice) 36 h after surgery. Purified 5 x 10° MDSCs from bone 
marrow of CD45.1mice bearing LLC tumours (day 0) were adoptively 
transferred into CD45.2 recipient mice in the sham-surgery tumour-naive 
model or LLC model (n=3 mice per group). Ina, f, two-sample, two-sided t-tests 
were used. All bars show mean +s.e.m. *P< 0.05, **P< 0.01, ***P< 0.001. 
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Extended Data Fig. 6 | Low-dose AET induces substantial changesin 
immune-cell chemotaxis and migrationin MDSCsin LLC mice. a, Left, 
schema showing the effect of low-dose AET onmonocytic or 
polymorphonuclear MDSCs transferred from CD45.1 to CD45.2 C57BL/6 mice 
inthe LLC model. CD45.1' cells (transferred polymorphonuclear MDSCs) were 
identified in the lungs of the recipient mice and analysed by flowcytometry 
(right, n=3 mice per group). b, Left, schema showing trafficking ability of 
adoptively transferred monocytic or polymorphonuclear MDSCs from low- 
dose-AET-treated or untreated CD45.1 mice in the LLC model. CD45.1' cells 
(transferred polymorphonuclear MDSCs) were identified in the lungs of the 
recipient CD45.2 mice and analysed by flowcytometry at 18 h after transfer 
(right, n=3 mice per group).c, Left, top 10 upregulated gene sets from GSEA of 
lung monocytic MDSCs after 72 h of treatment with low-dose AET. Middle and 
right, representative upregulated GSEA plots of immune-cell chemotaxis and 
migration. Colour gradation is representative of log,-transformed fold change 
over mock (n=3 biologically replicates). Gene-set enrichment P values, NES 
values and FDR values reported are calculated with 1,000 permutations inthe 
GSEA software. FDR q< 0.25 was deemed significant. d, DAVID analyses of 
significantly downregulated genes using the KEGG Gene Ontology in MDSCs 
from bone marrow of LLC mice, treated or untreated with low-dose AET. Top 20 
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downregulated pathwaysare presented (n=3 biologically replicates). 
Hypergeometric test (FDR-adjusted P< 0.05). e, Low-dose AET significantly 
decreases the nuclear activation of p52 and RELB (OD 450 nm (mock versus low- 
dose AET), p52: 0.95 + 0.035 versus 0.721+ 0.011, P= 0.0034; RELB, 

0.251+ 0.012 versus 0.1+ 0.003, P=0.0002), but not p50 and p65 in monocytic 
MDSCs from bone marrow of LLC mice in vivo. Nuclear lysates were incubated 
with oligonucleotides containing the NF-xkB-binding consensus sequence, and 
specific antibodies were used to detect the different subunits within the bound 
complexes (n=3 biological replicates). f, FACS shows the effect of 30 mgkg‘d* 
and 75 mgkg ‘dof BMS-345541 (a highly selective IKB kinase inhibitor) on 
CCR2 expression in monocytic MDSCs from bone marrow of LLC mice on day 3 
after surgery. The experiments were performedin triplicate, and similar results 
were obtained. g, CXCRland CXCR2 expression of polymorphonuclear MDSCs 
collected on day 3 from the bone marrow or lung detected by quantitative PCR 
(top) and FACS (bottom) in LLC mice treated with vehicle or with 72 h of low- 
dose AET (n=3 biological replicates). h, Transwell migration assay of sorted 
polymorphonuclear MDSCs from bone marrow of low-dose-AET- (72 h) or 
vehicle-treated LLC mice, induced by CXCL1 (20 ng mI and50ng mI") for 

120 min (n=3 biological replicates). In a,b, e, g,h, two-sample, two-sided t-tests 
were used. All barsshow mean +s.e.m. 
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Extended Data Fig. 7 | Low-dose AET promotes the differentiation of 
monocytic MDSCs towards macrophages in the LLC model. a, DAVID analyses 
of the significantly downregulated and upregulated genes using the KEGG 
Gene Ontology inLLC mice, treated or untreated with low-dose AET 

(n=3 biological replicates). Hypergeometric test (FDR-adjusted P< 0.05). 

b,c, The mRNA (b) and protein (c) levels of representative transcription factors 
were measured by quantitative PCR and western blot, respectively. In vitro, 
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splenic monocytic MDSCs from LLC mice were cultured for 3 days with tumour- 
conditioned medium. In vivo, monocytic MDSCs from both lungs of mock- 
(day 3) and low-dose-AET-treated LLC mice (day 3) were sorted for analysis. For 
gel source data, see Supplementary Fig. 1. Representative data were repeated 


at least three times with similar results. In b, atwo-sample two-sided t-test was 
used, n=3 biological replicates. Allbars show mean+s.e.m. 
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Extended Data Fig. 8| Low-dose AET promotes the differentiation of 
monocytic MDSCs towards an interstitial macrophage-like population in 
the lung premetastatic microenvironment. a, Gating strategy used to 
identify and analyse lung interstitial macrophages inthe lung premetastatic 
microenvironment by FACS. b, The effect of low-dose AET on lung interstitial 
macrophages from LLC mice. The percentage and cell counts of interstitial 
macrophages from both lungs in mock- and low-dose-AET-treated mice were 
analysed by FACSat day 3 after surgery (n=3 mice per group). Two-sample, two- 
sided t-test. All bars show mean +s.e.m.c, Gating strategy used to identify and 
analyse CD45.1' lung interstitial macrophages from the lungs of recipient 


modifiers treated PMN-MDSCs (n=7) modifiers treated PMN-MDSCs (n=7) 


CD45.2 mice after the transfusion of CD45.1' monocytic MDSCs. d, Kaplan- 
Meier curves showing the disease-free survival and overall survival of Ccr2- 
knockout LLC mice after transfusion of wild-type monocytic MDSCs (5 x 10°), 
low-dose-AET-treated (in vivo) wild-type monocytic MDSCs (5 x 10°) or vehicle 
at day land day 4, respectively. e, Kaplan-Meier curves showing the disease- 
free survival and overall survival of the Ccr2-knockout LLC mice after 
transfusion of wild-type polymorphonuclear MDSCs (5 x 10°), low-dose-AET- 
treated (in vivo) wild-type polymorphonuclear MDSCs (5 x 10°) or vehicle at 
day land day 4, respectively. Ind, e, two-sided log-rank tests were used. 
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Extended Data Fig. 9 | Low-dose AET inhibits pulmonary metastases and 
prolongs overall survival in mouse models. a, Representative photographs 
showing lungs treated with vehicle or low-dose AET in LLC (day 6) and HNMOO7 
(day 10) mice. The red arrows indicate the metastases. b, Representative CBCT 
images of lung metastases on day 6 after resection in LLC mice treated with 
vehicle or low-dose AET. The red arrows indicate the metastases.c, 
Representative H&E-stained images of lung sections from HNMOO7 (top) and 
4T1 (bottom) mice treated with low-dose AET or vehicle at different time points 
after surgery. Scale bars, 2mm. Graph shows area and numbers of metastatic 
nodules. At each time point, three mice were killed for analysis. For each 
sample, sections from three levels were analysed. Two-sample, two-sided ¢- 
test. d, Kaplan-Meier curves showing the disease-free survival and overall 
survival of HNMOO7 and 4T1 mice, treated with low-dose AET (for the 4T1 


model, 5-azacytidine 0.5 mg kg‘ d™ plus entinostat 2.5mgkg‘d‘) or vehicle 
after surgery.e, FACS showing the effect of T-cell-depleting antibodies on CD4* 
and CD8'T cellsin the peripheral blood of LLC mice. n=3 mice per group. Two- 
sample, two-sided ¢-test. f, Kaplan-Meier curves showing the disease-free 
survival and overall survival of LLC mice treated with vehicle, CCR2 antagonist 
(RS102895) (Sigma), low-dose AET and RS102895 plus low-dose AET after 
surgery. g, Kaplan-Meier curves showing the disease-free survival and overall 
survival of HNMOO7 mice treated with vehicle, CCR2 antagonist (RS504393) 
(Sigma), low-dose AET and RS504393 in combination with low-dose AET. Ind, f, 
g, two-sided log-rank tests were used. Representative data ina, b were repeated 
at least three times with similar results. All bars show mean +s.e.m. *P<0.05, 
*“P<0.01,***P<0.001. 
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Extended Data Fig. 10 | Representation of the effect of low-dose adjuvant 
epigenetic modifiers on lung metastases in the tumour pulmonary- 
metastasis models. Graphic model showing the inhibition of pulmonary 
metastases by low-dose AET via its effect on MDSCs. First, low-dose AET can 
inhibit the trafficking of monocytic and polymorphonuclear MDSCs from the 
bone marrowtothe premetastatic microenvironment by downregulating the 
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expression of CCR2 and CXCR2, respectively. Second, even if MDSCs migrate 
tothe lung, low-dose AET can skew the differentiation of monocytic MDSCs 
towards an interstitial macrophage-like phenotype in the lung premetastatic 
microenvironment. Therefore, low-dose AET can disrupt the lung 
premetastatic microenvironment, ultimately inhibiting pulmonary 
metastases. 


Malcolm V. Brock, Lin Shen, Stephen B. Baylin, 
j | a Te {¢ S¢ ar Corresponding author(s): Franck Housseau 


Last updated by author(s): Dec 3, 2019 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 
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Policy information about availability of computer code 


Data collection 1.FACSAria Il SORP cell sorter, LSRII, FACS Diva(BD Biosciences); MoFlo XDP Cell Sorter (BECKMAN COULTER) 
2.Cone beam computed tomography (CBCT) guided systems(Department of Radiation Oncology and Molecular Radiation Sciences, Johns 
Hopkins University) 
3.Nikon Eclipse NiE microscope (Nikon Instruments Inc.) 
4.NanoDrop One(Thermo Scientific) 
5.Bio-Rad CFX96 Real-Time System and Bio-Rad CFX Manager v2.1 (BIO-RAD) 
6.Bio-Rad TC20 Automated Cell Counter(BIO-RAD) 


Data analysis GraphPad Prism version 7.0, FlowJo software version 10.0.7, Photoshop(Adobe),Aperio Imagescope software version 12.4, R version 
3.6.1, EZ-C1 software version 3.90, Broad Institute GSEA 3.0, DAVID version 6.8, Summit 5.4. 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All data generated are included in the published Letter and in its Supplementary information. Gene-expression data that support the findings of this study have 
been deposited in the Gene Expression Omnibus under accession numbers GSE124539. All data are also available from the corresponding author on reasonable 
request. 


=) 
je’) 
a 
Ss 
= 
o 
= 
o 
Nn 
© 
je’) 
= 
2) 
=r 
= 
a 
se) 
O 
= 
= 
=) 
a 
Wn 
iS 
= 
> 
je’) 
< 


J2G012Q 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No statistical methods were used to predetermine sample sizes. Sample sizes were selected empirically from previous experimental 
experience with similar assays, and/or from sizes generally employed in the field. 


=) 
je’) 
a 
Ss 
= 
o 
= 
o 
Nn 
© 
ced) 
= 
@) 
=r 
= 
O 
Ze) 
fo) 
= 
= 
=) 
a 
Wn 
iS 
= 
> 
red) 
< 


Data exclusions No exclusion of data was made. 


Replication Experiments were performed at least 3 times and/or with sufficient cells/animals per group to demonstrate statistical significance. All 
attempts at replication were successful. 


Randomization _ This is applicable to the in vivo animal experiments: Mice were divided randomly into cages before tumor resection and treated with 
corresponding drugs postresection. 


Blinding The performer of the animal experiment was blinded to the randomization process. Pathological examination was done in a blinded fashion. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Antibodies were used for flow cytometry, immunoblotting, immunofluorescence and ELISA. 
- Flow cytometry was conducted using fluorochrome-conjugated antibodies. 
- For staining of mouse cells we used: anti-CD45(Biolegend, 103116),anti-CD11b(BD biosciences, 563168),anti-Gr-1(BD 
biosciences, 553127),anti-CD3(BD biosciences, 552774),anti-CD4(BD biosciences, 562891),anti-CD8a(Biolegend, 100711), anti- 
CD25(BD biosciences, 552880),anti-Foxp3(BD biosciences, 560414),anti-CD45.1(Biolegend, 110708), anti-CD192(Biolegend, 
150610), anti-Ly6C(Biolegend, 128008), anti-Ly6G(Biolegend, 127606),anti-I-A/I-E(BD biosciences, 562564),anti-CD11c(BD 
biosciences, 550261),anti-CD24(BD biosciences, 564237),anti-CD64(BD biosciences, 740622),anti-CD16/32(Biolegend, 
101320), anti-F4/80(Biolegend, 123145),anti-CXCR1(BD biosciences, 566383),anti-CXCR2(Biolegend, 149609), anti- 
Ki67(Biolegend, 652409). All antibodies were used at 1:100 dilution for flow cytometry staining. 
-Immunoblot: anti- MMP9(abcam, ab38898),anti-TGF-beta(abcam, ab179695),anti-ARG1(abcam, ab91279),anti-VEGF-A(abcam, 
ab214424),anti-S100A8(abcam, ab92331),anti-TNF-alpha(abcam, ab183218), anti-IL-6(Cell Signaling Technology, 12912), anti- 
EGR1(Abcam, ab133695),anti-EGR2(Abcam, ab108399),anti-PPAR-gamma(Cell Signaling Technology, 2435),anti-Maf-b(Santa 
Cruz Biotechnology, sc-376387),anti-p50(Cell Signaling Technology, 55764), anti-p52(Cell Signaling Technology, 4882),anti-Rel- 
B(Cell Signaling Technology, 55764),anti-p65(Cell Signaling Technology, 55764),anti-beta-actin(Sigma, A5441),anti-DNMT1(Cell 
Signaling Technology, 5032),and anti-rabbit IgG, HRP-linked antibody and anti-mouse IgG, HRP-linked antibody(Cell Signaling 
Technology, 7074, 7076). All primary antibodies were used at 1:1000 dilution for immunoblotting, except anti-Maf-b was used at 
1:100 and anti-beta-actin was used at 1:10000. Secondary antibodies were used at a 1:2000 dilution. 
- Immunofluorescence analysis: Anti-Gr-1 antibody (Biolegend, 108401, 1:500),anti-GFP antibody(Abcam, ab183734, 1:200), 
anti-CD4 antibody (Abcam, ab183685, 1:200), anti-CD8a antibody (ebioscience, 14-0808-80, 1:100),and goat anti-rabbit and anti- 
rat secondary antibodies(Invitrogen, A27034, A18876). 
- IFN-gamma ELISA kits: Mouse IFN-gamma (DY485) from R&D. 
For in vivo CD4/CD8 depletion we used: anti-mouse CD4 antibody(BEO003-1) , anti-mouse CD8a antibody(BE0118) and isotype 
purchased from Bioxcell. 
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Validation All antibodies are commercially available and validated by the manufactures for the applications and species used in this study. 
See manufactures websites for validation statements (www.abcam.com; www.cellsignal.com; https://www.bdbiosciences.com/ 
en-us; https://www.biolegend.com; https://bxcell.com; https://www.thermofisher.com; https://www.sigmaaldrich.com; https:// 
www.rndsystems.com) 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) Lewis lung carcinoma cells (LLC1) and 4T1 cells were obtained from ATCC.HNMOO7, a p53-null murine esophageal squamous 
cell carcinoma cell line transformed by HRASG12V, was kindly provided by Prof. S Singhal (University of Pennsylvania). 


Authentication No method authentication was performed for the mouse cell lines used. 


Mycoplasma contamination Cell lines were routinely tested for mycoplasma and immediately tested upon suspicion. None of the lines used in the 
reported experiments tested positive. 


Commonly misidentified lines None were misidentified. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals 6-8 week-old female C57BL/6 mice and BALB/c mice were purchased from Charles River Laboratories (Wilmington, MA). 6-8 
week-old female NOD/SCID/g-chain knockout (NSG) mice, bred and housed at the Johns Hopkins Animal Care Facility, were used. 
6-8 week-old female B6.129S4-Ccr2tm1Ifc/J mice were kindly provided by Prof. Sharon A. McGrath-Morrow (Johns Hopkins 
University). 6-8 week-old female mice congenic in murine CD45 at the Ly5 locus (B.6SJL-Ptprca Pepcb/BoyJ Ly5.1) were obtained 
from Jackson Laboratory. 


Wild animals No wild animals were used in the study. 
Field-collected samples No field-collected samples were used in the study. 
Ethics oversight The Guide for the Care and Use of Laboratory Animals published by the National Institutes of Health (NIH publication 96-01, 


revised 1996) were followed, and all protocols were approved by the Johns Hopkins University Animal Care and Use Committee. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics 13 patients(4 female, 9 male) with stage | non-small cell lung cancer and an Eastern Cooperative Oncology Group (ECOG) 
performance status of 0-1 after curative surgery. 


Recruitment Patients with stage | NSCLC who presented at John Hopkins for medical care and fulfilled the eligibility criteria as per approved 
protocol were offered participation in the study. We are unaware of any potential self-selection bias or other biases present. 


Ethics oversight Human biological samples were sourced ethically and their research use was in accord with the terms of the informed consent 
provided under the institutional protocols approved either by Johns Hopkins Hospital Ethics Committee. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Clinical data 


Policy information about clinical studies 
All manuscripts should comply with the ICMJE guidelines for publication of clinical research and a completed CONSORT checklist must be included with all submissions. 


Clinical trial registration NCT01207726 


Study protocol Study protocol is accessible on Clinical Trials website. More information is available from the corresponding author upon 
reasonable request. 


Data collection The J1037 (NCTO01207726) study was a randomized phase || study that compared the low dose adjuvant epigenetic therapy (Aza 
plus entinostat) with standard of care (observation) in patients with stage | (T1-2aNO) NSCLC after primary tumour resection in 
Johns Hopkins Hospital and Anne Anmdel Medical Center. The patients from the adjuvant epigenetic treatment group received 
the combination of azacitidine at 40 mg/m2 on days 1-5 and 8-10 with entinostat at a 7 mg fixed dose on days 3 and 10 of each 
28 days cycle. The primary end point was the effect of Aza plus entinostat on the hazard of 3 years progression-free survival in 
patients with resected stage | non-small cell lung cancer. Finally, thirteen patients were enrolled in the trial. Due to the difficulty 
in enrolling patients, the trial was prematurely terminated on May 1, 2015. 
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Outcomes Primary Outcome Measures : 
1. Disease-free Survival (DFS) [ Time Frame: 3 years ] 
The DFS hazard rate and 95% confidence interval will be reported. At this time, event time distributions for disease-free survival 
in the two arms will be estimated with the method of Kaplan and Meier and compared using a stratified Cox-proportional 
hazards model (stratified for stage IA vs IB) with a two-sided alpha of 10%. 
Secondary Outcome Measures : 
1. Factors That Predict Clinical Outcome in Patients Treated With Combination Epigenetic Therapy in Terms of Epigenomic Data 
Generated From the Illumina Platform [ Time Frame: Up to 2 years ] 
The study was terminated early due to poor accrual since the requirement of clinic administration of the 5AZA daily and post- 
Operative patients not wanting 6 months of treatment. For this reason, 13 pts were enrolled and data was not analyzed, for 
which we are unable to make any conclusions or report results. 
2. Median Disease-free Survival [ Time Frame: Up to 5 years ] 
Determined by the method determined by Kaplan and Meier. Estimated with 95% confidence intervals. Cox proportional hazard 
modeling will be used for multivariate analysis. 
3. Number of Relapses and Deaths Per Total Time of Follow-up Comparing Patients With N2 Lymph Nodes in Terms of 
ethylated and Unmethylated [ Time Frame: Up to 5 years ] 
aplan Meier curves will be used. 
4. Overall Survival [ Time Frame: Up to 5 years ] 
Determined by the method determined by Kaplan and Meier. Estimated with 95% confidence intervals. Cox proportional hazard 
modeling will be used for multivariate analysis. 
5. Presence of Methylation Patterns [ Time Frame: Up to 2 years ] 
cNemar's test will be used to compare the change in methylation after treatment in sputum. 
6. Toxicities Graded According to the National Cancer Institute Common Terminology Criteria for Adverse Events (NCI CTCAE) 
v4.0 [ Time Frame: Up to 5 years ] 
Simple descriptive statistics will be utilized to display the data. 
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Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a ‘group’ is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 
Sample preparation See methods 
nstrument Cell were recorded on an LSR II Flow Cytometer (BD Biosciences), or on a MoFlo XDP Cell Sorter (BECKMAN COULTER) 
Software Data were analyzed using FlowJo v10.0.0 software, or Summit 5.4 software. 


Cell population abundance __ For sorting experiments, the cell were subsequently analyzed by FACS and >98% purity was achieved. 


Gating strategy Following routine gating strategy. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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The DNA-dependent protein kinase (DNA-PK), which comprises the KU heterodimer 
and a catalytic subunit (DNA-PKcs), is a classical non-homologous end-joining (CNHE)) 


factor’. KU binds to DNA ends, initiates CNHEJ, and recruits and activates DNA-PKcs. 
KUalso binds to RNA, but the relevance of this interaction in mammals is unclear. Here 
we use mouse models to show that DNA-PK has an unexpected role in the biogenesis 
of ribosomal RNA (rRNA) and in haematopoiesis. The expression of kinase-dead DNA- 
PKcs abrogates cNHE)”. However, most mice that both expressed kinase-dead DNA- 
PKcs and lacked the tumour suppressor TP53 developed myeloid disease, whereas all 
other previously characterized mice deficient in both cNHEJ and TP53 expression 
succumbed to pro-B cell lymphoma’. DNA-PK autophosphorylates DNA-PKcs, which 
is its best characterized substrate. Blocking the phosphorylation of DNA-PKcs at the 
T2609 cluster, but not the S2056 cluster, led to KU-dependent defects in 18S rRNA 
processing, compromised global protein synthesis in haematopoietic cells and 
caused bone marrow failure in mice. KU drives the assembly of DNA-PKcs ona wide 
range of cellular RNAs, including the U3 small nucleolar RNA, whichis essential for 
processing of 18S rRNA‘. U3 activates purified DNA-PK and triggers phosphorylation 
of DNA-PKcs at T2609. DNA-PK, but not other cNHE) factors, resides in nucleoliin an 
rRNA-dependent manner and is co-purified with the small subunit processome. 
Together our data show that DNA-PK has RNA-dependent, cNHEJ-independent 
functions during ribosome biogenesis that require the kinase activity of DNA-PKcs 
and its phosphorylation at the T2609 cluster. 


The cNHEJ pathway directly ligates DNA double-strand breaks (DSBs) 
and is required for V(D)J recombination in developing lymphocytes’. 
The KU70-KU86 (Ku80 in mice) heterodimer (KU) binds to DNA ends, 
promotes ligation by the DNALIG4-XRCC4-XLF complex’, and recruits 
and activates DNA-PKcs, which activates the Artemis endonuclease 
for end-processing'. DNA-PKcs is required for end-processing, but not 
blunt-end ligation® ®. Productive V(D)J recombination requires both 
end-processing and end-ligation®. Correspondingly, DNA-PKcs-null 
mice lack mature lymphocytes, but are otherwise normal®®. 

To understand how DNA-PK activity regulates DNA-PKcs, the best 
characterized substrate of DNA-PK, we generated DNA-PKcs” mice? 
(also known as Prkdc®””), DNA-PKcs*° mice died embryonically with 
TP53-dependent neuronal apoptosis’, similarto end-ligation-deficient 
Xrcc4" or Lig4’ mice, indicating that, once recruited to DNAends, 
DNA-PKcs requires its kinase activity to license end-ligation. Ina TP53- 
deficient background, cNHEJ-deficient mice (such as Xrcc4”" mice) 


succumb to pro-B cell lymphomas by the age of 100 days®*" (Fig. 1a—c). 
Unexpectedly, most DNA-PKcs Tp53” mice (Tp53 is also known as 
Trp53) died by 40 days without lymphomas (fewer than 25% survived 
for more than 80 days and developed pro-B cell lymphomas; Fig. 1la—c, 
Extended Data Fig. 1a). Pro-B cell lymphomas from Xrcc4’ Tp53” mice 
carry an alternative end-joining (alt-EJ)-mediated immunoglobulin /gh/ 
Myc translocation and co-amplification’, leading to overexpression 
of MYC. Alt-E) is not affected in DNA-PKcs" 7p537 B cells”, so lack 
of translocation cannot explain the low incidence of lymphomas in 
DNA-PKcs®“T p53” mice. Moreover, the early death of DNA-PKcs’ 
*2Tp537 mice is not dueto incomplete rescue of the lethal DNA-PKcs*”’ 
* phenotype by lack of TP53, as most DNA-PKcs" Tp53” mice sur- 
vived to adulthood before succumbing to thymic lymphomas and/ 
or sarcomas due to 7p53 deficiency” (Fig. la-c). Bone marrow and 
spleen samples from young DNA-PKcs’ T p53” mice revealed amye- 
loid expansion, accumulation of hypolobated and hyperchromatic 
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Fig. 1| Kinase dead or phosphorylation deficient DNA-PKcs causes 
haematopoietic defects independent of cNHEJ. a, Kaplan-Meier survival 
curve of DNA-PKcs°"*“Tp537 and control (DNA-PKcs”"°Tp53, DNA- 

PKcs’ Tp537 ,Xrcc4’ Tp537 and Tp53”) mice. Log-rank (Mantel-Cox) test; 
***P< 0.001, NS (not significant) P> 0.05. b, Cause of death for tumour cohorts 
by genotype.c, The life expectancy for mice of different genotypes plotted by 
tumour type. Pro-B cell lymphomas in DNA-PKcs®©”Tp53~ mice have similiar 
latency as in DNA-PKcs’ Tp53” and Xrcc4” Tp53” mice (two-sided unpaired 
Student’s t-test; ***P< 0.001, NS P> 0.05). Data shown as mean +S.e.m. 

d, Representative histological analyses of bone marrowand spleen from 
3-week-old DNA-PKcs" Tp53” mice. The samples were stained for the myeloid 
marker myeloperoxidase before being counterstained with haematoxylin (for 
nuclei). Arrows indicate megakaryocytes. Scale bars, 100 ppm. e, Representative 
histology analyses of bone marrow from 2-week-old DNA-PKcs“ and DNA- 
PKcs*“ mice with or without 7p53 deficiency. Scale bars, 100 um. Exact 
Pvalues and defined samplesizes (n) are provided inSupplementary Datal. 


megakaryocytes, and occasionally c-KIT-positive myeloid blasts 
(Fig. 1d, Extended Data Fig. 1b, c), characteristic of myelodysplastic 
syndrome (MDS). MDS does not manifest in other cNHEJ- and TP53- 
deficient mice, suggesting that kinase-dead DNA-PKcs might have a 
role in haematopoiesis beyond cNHE)J. 


DNA-PKcs mutations affect haematopoiesis 


Toinvestigatewhether loss of autophosphorylation of DNA-PKcsexplains 
the haematopoietic defects, we generated mice with alanine substitu- 
tions to either of thetwo autophosphorylation clusters»—the T2609 
cluster (DNA-PKcs**) or the $2056 cluster (DNA-PKcs’2*)!® (Extended 
Data Fig. 2a-d). DNA-PKcs??? mice were healthy with normal 
red blood cell (RBC) and platelet counts (Extended Data Fig. 2e, f). 
By contrast, alanine substitutions at all five (DNA-PKcs™”™) or three out 
of five (DNA-PKcs*“)” threonines in the T2609 cluster caused TP53- 
dependent bone marrow failure (Fig. le) and death by 4 weeks of age 
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(Fig. 2a) with severe anaemia, pale organs and thrombocytopaenia 
(Fig. 2b, Extended Data Fig. 2g-i). Moreover, haematopoietic stem 
and progenitor cells (HSPCs; Lin SCAI‘c-KIT*) and their downstream 
progenitors (Lin SCAT c-KIT*) decreased by around 1,000-fold in DNA- 
PKcs4 mice (Fig. 2c, d, Extended Data Fig. 3a, b). Fetal liver HSPC 
counts from DNA-PKcs“ and DNA-PKcs*”® mice also decreased by 
half (Extended Data Fig. 3c, d), suggesting that these mice had an early 
haematopoietic defect. Thus, both the kinase activity of DNA-PKcs and 
autophosphorylation at the T2609 cluster are necessary for haemat- 
opoiesis in mice expressing DNA-PKcs. 


Effect of DNA-PK mutations on DNArepair 


Next, we investigated whether the anaemia in DNA-PKcs“ mice 
was triggered by defects in known functions of DNA-PKcs. DNA- 
PKces**4Tp53” mice were viable” (Fig. 2a) and had wild-type levels of 
myeloid cells, neutrophils, erythrocytes and platelets, and notably 
had a substantial number of mature B and T lymphocytes (Extended 
Data Fig. 4a-e). By contrast, DNA-PKcs’ Tp53’ and DNA-PKcsk” 
*®Tp537 mice? lacked mature B and T lymphocytes (Extended Data 
Figs. la, 4a, b), suggesting that DNA-PKcs™ supports lymphocyte 
development. Accordingly, V(D)J recombination occurred efficiently 
in DNA-PKcs” (Extended Data Fig. 4f-h) and DNA-PKcs“ B cells". 
T cell development was also unaffected in DNA-PKcs“ Tp53*” mice 
(Extended Data Fig. 4i). Moreover, deletion of KU, which abrogates 
DNA-PK assembly and cNHE], rescued lethality (Fig. 2a) and restored 
HSPCs, RBCs and neutrophils, but not lymphocytes, in DNA-PKcs“ 
mice (Fig. 2b-d, Extended Data Fig. 4d, e), suggesting that DNA-PKcs 
has cNHEJ-independent functions during haematopoiesis that require 
phosphorylation of the T2609 cluster. Accordingly, DNA-PKcs*4 
mouse embryonic stem cells (ES cells) and fibroblasts (MEFs) were 
not markedly hypersensitive to DNA-damaging agents such as mito- 
mycin C (MMC) or ionizing radiation (IR)°”? (Extended Data Fig. 5a-e). 
DNA-PKcs“ MEFs were more resistant to MMC than MEFs from mice 
deficient in the Fanconi anaemia (FA) pathway (Fancd2”; Extended 
Data Fig. 5b). Unlike humans with FANCD2 deficiency, Fancd2’ mice 
donot develop spontaneous lethal anaemia”. These findings suggest 
that defects inthe FA pathway cannot explain the lethal bone marrow 
failure of DNA-PKcs mice. DNA-PKcs MEFs proliferated well, with 
normal S-phase frequency (Extended Data Fig. 5f, g), in contrast to 
the marked proliferation defects of CNHEJ-deficient MEFs’. On the 
other hand, DNA-PKcs“ ES cells had reduced S-phase and formed 
smaller colonies (Extended Data Fig. 6a—c), whereas cNHE)-deficient 
ES cells proliferate well®’°. CRISPR-mediated deletion of Ku8O rescued 
the colony-formation defects of DNA-PKcs“ ES cells (Extended Data 
Fig. 6a, b). Telomere stability?” was comparable between DNA-PKcs“ 
and DNA-PKcs“ cells (Extended Data Fig. 6d-f). These findings and 
the fact that haematopoiesis is unaffected inage-matched DNA-PKcs” 
mice (Fig. 2b, Extended Data Fig. 2g), suggest that neither telomere 
dysfunction nor cNHE) deficiency alone can explain the lethal anaemia 
in DNA-PKcs*™ mice. 


DNA-PK mutations compromise translation 

In addition to DNA, purified KU also binds to RNA~. Yeast Ku binds 
to the telomerase RNA component (TERC)”*™, but the RNA part- 
ners of mammalian KU and the relevance of KU-RNA interactions 
in mammalian cells remain unclear. Without DNA damage, KU and 
DNA-PKcs, but not other cNHE) factors, reside in the nucleolus in 
a detergent-resistant and Pol I transcription-sensitive manner” 
(Extended Data Fig. 8a—c), suggesting that DNA-PK has an RNA- 
dependent role in the nucleolus. Defects in ribosome biogenesis 
can trigger TP53-dependent haematopoietic failure’, and protein 
synthesis is tightly regulated in HSPCs””. Defects in protein synthe- 
sis, as exemplified in Diamond-Blackfan anaemia (DBA)”°, cause 
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Fig. 2 | Mutations in DNA-PKcs cause KU-dependent haematopoietic failure 
and translation defects. a, Kaplan-Meier survival curve of DNA-PKcs“, DNA- 
PKces*““Tp53”, DNA-PKcs“ Tp537, DNA-PKcs“ Ku70“ and Ku70~ mice. 

b, Ku70 deficiency rescued the peripheral RBC counts of 2-week-old DNA- 
PKcs*“““Ku707 mice.c,d, Representative flowcytometry analyses (c) and the 
absolute number of haematopoietic stem and progenitor cells (Lin SCAI°c- 
KIT’) (d) from 2-week-old DNA-PKcs“ and DNA-PKcs““"4Ku70~ mice. e, MCV of 
RBCs from 2-week-old DNA-PKcs and DNA-PKcs” Ku707 mice. 

f, Representative flowcytometry analyses of bone marrow RBCs from 2-week- 
old DNA-PKcs“ and DNA-PKcs“ Ku707 mice. CD71 and TER119 staining 
determines the stage of RBC differentiation. S1, CD71‘ TER119"™; 

$2, CD71‘ TER119™4; $3, CD71°TER119" 2"; $4, CD71™“TER119"2"; 

S5, CD71 TER119"®". Forward scatter of TER119* RBCs is shown (bottom). 

g, Relative frequencies of S2,S3 and S4 among all immature erythroblasts 


TP53-dependent macrocytic anaemia with increased mean corpus- 
cular volume (MCV, RBC size). DNA-PKcs“ mice but not DNA-PKcs” 
mice displayed TP53-depedent macrocytic anaemia (30% increase in 
MCV; Fig. 2b, e, Extended Data Figs. 4c, 6g, 7a). In DNA-PKcs“ mice 
the percentage of CD71*Ter119"#" (S3) erythroblasts was lower in fetal 
liver at embryonic day 14.5 (E14.5) and in bone marrow at postnatal 
day 14 (P14) than in wild-type mice (Fig. 2f, g, Extended Data Fig. 7b, 
c). Notably, CD71‘ erythroblasts (S1-S3) have the highest transla- 
tion levels among all the haematopoietic cell-types examined (Fig 
3f, Extended Data Figs. 3e, 7d), as measured by the incorporation 
of O-propargyl-puromycin (OP-Puro) into nascent proteins””. An 
OP-Puro'™ population accumulated throughout erythroblast devel- 
opment in the P14 bone marrow (Fig. 2h, i) and fetal liver (Extended 
Data Fig. 7e, f) of DNA-PKcs“ mice, suggesting that there were 
global translation defects. DNA-PKcs“ Tp53”” erythroblasts also 
displayed consistent translation defects (Extended Data Fig. 7g), 
suggesting that DNA-PKcs has a TP53-independent role in protein 
synthesis. Likewise, TP53-deficient, v-ABL kinase-transformed pro-B 
cell lines”? derived from DNA-PKcs” or DNA-PKcs*“4 mice showed 
reduced translation (Extended Data Fig. 7h). Chemical inhibition of 
DNA-PKcs (with NU7441), but not of the related ataxia telangiectasia- 
mutated (ATM) kinase (with KU55933), reduced global translation 
(Fig. 2j, Extended Data Fig. 7i). DNA-PKcs’ and Ku707" (also known 
as Xrcc6) B cells had moderate but consistent translation defects that 
were insensitive to inhibition of DNA-PK (Fig. 2j). Erythrocyte counts 
and translation of $3 erythroblasts in young DNA-PKcs’ or Ku70” 
mice were comparable to those in wild-type mice, and significantly 
(P< 0.05) higher than in DNA-PKcs“ mice (Fig. 2b, f-i, Extended Data 
Fig. 6g, h). These results reveal a critical role for DNA-PK kinase and 
phosphorylation of the T2609 cluster during translation in HSPCs 
in the presence of KU and DNA-PKcs. 


(S1-S4). The S5 percentage is much higher in DNA-PKcs“ mice (Extended Data 
Fig. 7c).h, Measurements of translation in S1, S2 and S3 erythroblasts from P14 
DNA-PKcs“ and DNA-PKcs“ mice bone marrow. i, Quantification of the 
frequency of OP-Puro'™ among S1, S2 and $3 erythroblasts by OP-Puro labeling 
of bone marrow cells for 1h. j, Inhibition of DNA-PKcs with NU7441 leads toa 
DNA-PKcs- and Ku-dependent reductionin global translation. Graphs show 
fold change in OP-Puro fluorescence (mean +s.d.) normalized to DNA-PKcs‘* 
cells (set to 100%). Wild-type with or without NU7441 (44% reduction with 
NU7441); DNA-PKcs” with or without NU7441 (-1% reduction with NU7441), and 
Ku707 with or without NU7441 (-10% reduction with NU7441). 

b,d,e,g,i, Mean+s.e.m.a, Two-sided log-rank (Mantel-Cox) test, ***P< 0.001. 
j, Two-sided paired Student’s ¢-test; all other panels, two-sided unpaired 
Student’s t-test, ***P< 0.001; **P<0.01;*P< 0.05; NS, P> 0.05. Exact Pvalues 
and defined sample sizes (n) are provided in Supplementary Data1. 


DNA-PK binds to RNA 


The biological importance of KU in the nucleolus is unknown. We 
hypothesized that defects in ribosome biogenesis might underlie the 
translation defects in DNA-PKcs" and DNA-PKcs HSPCs. Indeed, 
blocking the activity of RNA polymerase | (Pol I) with actinomycin D 
(ActD) depleted KU and DNA-PKcs from the nucleolus in human and 
mouse cells (Extended Data Fig. 8a—c), suggesting that DNA-PKcs and 
KU reside in the nucleolus in an rRNA-dependent manner, potentially 
as part of pre-rRNA ribonucleoprotein complexes. Thus, we isolated 
the small subunit (SSU) processome via the U3 small nucleolar RNA 
(snoRNA; herein U3). U3 coordinates splicing of the 5’-external tran- 
scribed spacer (5’-ETS) and thus maturation of the 40S ribosomal 
subunit®°. We identified U3 binding proteins using comprehensive 
identification of RNA-binding proteins by mass spectrometry (ChIRP- 
MS)*! in two different cell types (Fig. 3a, b, Extended Data Fig. 8d, e). 
ChIRP-MS recovered all known and conserved components of the 
eukaryotic SSU processome*” and the DNA-PK holoenzyme, but not 
other cNHE)J factors (Fig. 3a, b, Supplementary Table 1). Comparative 
analyses of the U3 ChIRP-MS and KU86 immunoprecipitation (IP)-MS® 
confirmed that KU86 associates with SSU components (Extended Data 
Fig. 9a). If the assembly of DNA-PK*® or DNA-PK* affects rRNA process- 
ing, unprocessed rRNA intermediates might accumulate in DNA-PK 
mutant cells. Northern blots from DNA-PKcs", DNA-PKcs” and DNA- 
PKcs*“*4 cellsrevealed partial accumulation of the 21S and 12S pre-rRNA 
precursors of the 18S and 5.8S rRNAs, respectively (Fig. 3c, d, Extended 
Data Fig. 9b). Given that DNA-PKcs mutant cells were viable, we expected 
the rRNA processing defects to be less severe than and/or different 
from those resulting from SSU ablation. Notably, deletion of Ku70 
along with mutations in DNA-PKcs rescued the rRNA processing defects 
(Fig. 3c, d, Extended Data Fig. 9b). The 21S pre-rRNA intermediate also 
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Fig. 3 |DNA-PK, but not other cNHE}J factors, co-purifies with the U3 snoRNA 
and regulatesrRNA processing. a, A protein-protein interaction (grey lines) 
network of U3 ChIRP-MS from HeLa cells of nucleolar and ribosomal biogenesis 
factors. Colours map to the enrichment log,[ChIRP/control] value of each 
protein. Known complexes are physically grouped and labelled. Complete hit 
list detailed in Supplementary Table1.b, Cellular compartment term analysis 
of the U3 ChIRP-MS from HeLa cells using the DAVID tool. False discovery rate 


accumulated in CD34 bone marrow cells from patients with DBA**. 
Thus, the KU-mediated assembly of a catalytically inactive or T2609 
phosphorylation-defective DNA-PK leads to rRNA processing defects. 

To identify the spectrum of cellular RNAs that are bound by KU and 
DNA-PKcs, we performed infrared crosslinking and immunoprecipi- 
tation® (irCLIP) of KU86 and DNA-PKcs (Extended Data Fig. 10a, b). 
Annotation of the reverse transcriptase (RT) stops across the transcrip- 
tome revealed the interaction profile of KU86 and DNA-PKcs (Extended 
Data Fig.10c, e). Although most KU86 and DNA-PKcs RT stops mapped 
to introns of mRNA or non-coding RNAs (Extended Data Fig. 10c-f), 
they showed little overlap in this category (Extended Data Fig. 10e, f). 
TheRT stops shared by KU86 and DNA-PKcs included known DNA-PK- 
interacting RNAs, suchas Terc and Neat1* (Extended Data Fig. 10g, h). 
We also observed binding to non-coding RNAs with essential roles in 
ribosome biogenesis, including the 5’-ETS and U3 (Fig. 4a, b). Binding 
of DNA-PK to the 5’-ETS was sensitive to ActD (Extended Data Fig. 10i,j), 
which might contribute to the Pol I-dependent localization of DNA-PKcs 
and KUin the nucleolus (Extended Data Fig. 8a-c). 


The U3 snoRNA can activate DNA-PK 


DNA-PKcs and KU86 RT stops were biased towards the 5’ end of U3, 
near theconserved hinge region (Fig. 4b) that is essential for 18S rRNA 
maturation**. By contrast, DNA-PKcs and KU86 showed little prefer- 
ence towards the snoRNP domain of U3 (Fig. 4b). Correspondingly, 
DNA-PKcs and KU86 bound few and dissimilar snoRNAs, when com- 
pared tothesnoRNA-binding protein DDX21” (Extended Data Fig. 11a, 
b). To understand whether KU86 drives the assembly of DNA-PK at U3, 
we mapped the DNA-PKcs and KU86 peaks to an in silico secondary 
structure (Fig. 4c) of U3**. The DNA-PKcs and KU86 crosslinking sites 
are adjacent to one another in the same predicted stem-loop of U3 
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(FDR) was estimated (Benjamini corrected P) by comparing the 483 enriched 
proteins tothe human proteome; the top ten most enriched terms are shown. 
c,d, Northern blot analyses of 18S and 5.8S rRNA maturation in v-ABL kinase 
transformed B cells from noted genotypes. The probe covers the 3’ edge of 18S 
rRNA and 5.8S rRNA, respectively (blue line). The normalized relative 
intensities of 21S/34S, 18S/21S and 12S/32S are marked below the gels. n=4 (c), 
n=3(d) biologically independent experiments. 


(U3-SLI1; Fig. 4c), and within the DNase I footprint of DNA-PK’. KU86 
crosslinked mainly tothe terminal hairpins (Fig. 4c). KU can assemble 
DNA-PK at DNA fragments with hairpin ends. Incubation of U3-SL1 
with the native DNA-PK complex, purified from human cells, showed 
that KU bound toa substantial fraction of U3-SL1 (Extended Data 
Fig. 11c). Anti-KU86 antibodies supershifted the KU-U3-SL1 complex 
(Extended Data Fig. 11d). A mutated U3-SL1 hairpin failed to compete 
with wild-type U3-SL1 for KU86 binding (Extended Data Fig. Ile), sug- 
gesting that RNA structure is important for binding of KU. At higher 
concentrations of DNA-PK, KU was able to drive assembly of DNA-PK 
on U3-SL1 in the absence of ATP (Extended Data Fig. 11c). Moreover, 
U3-SL1 drove auto-phosphorylation of DNA-PK at the T2609 cluster 
(Extended Data Fig. 11f), but no efficient phosphorylation was detected 
at the T2056 cluster (Extended Data Fig. 11g). This RNA-based activity 
depended on DNA-PKcs kinase, as NU7441 or non-hydrolysable ATP 
blocked T2609 phosphorylation (Extended Data Fig. 11h-j). Similarly, 
U3-SL1 promoted baculovirus-purified human DNA-PK to undergo 
auto-phosphorylation at T2609, but not $2056 (Fig. 4d, Extended Data 
Fig. 11k). U3-SL1 activated DNA-PK to phosphorylate serine 15 on TP53 
peptides, with efficiency comparable to the basal activity of isolated 
ATM (Fig. 4e). Notably, DNA-PK assembly and auto-phosphorylation 
at U3-SL1is less efficient than with dsDNA (Extended Data Fig. 11f). 
We speculated that additional factors might be available to facilitate 
efficient DNA-PK assembly and activation by RNA. 


RNA-dependent function of DNA-PK 


Despite the accumulation of KU and DNA-PKcs in the nucleolus, young 
Ku” or DNA-PKcs” mice do not have deleterious translation defects. 
Using DNA-PKcs” and DNA-PKcs“ mouse models, we have identi- 
fied an unexpected role for DNA-PK during ribosome biogenesis and 
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Fig. 4|U3snoRNA drives assembly, activation, and auto-phosphorylation of 
DNA-PK at the T2609 cluster. a, DNA-PKcs and KU86RT stops that map to the 
four pre-rRNA introns; 5S’ETS, ITS1, ITS2, and 3’ETS. irCLIP was performed in 
biological duplicate. b, Transcript normalized histogram of DNA-PKcs (orange) 
and KU86 (blue) irCLIP RT stops from DMSO-treated HeLa cells mapping to U3. 
c, Insilico (mFold)-predicted secondary structure of U3. The top three and two 
peaks of DNA-PKcs and KU86, respectively, are annotated onthe secondary 
structure. The length of the 5’end stem-loop is shown. d, Baculovirus-purified 
human DNA-PK in vitro kinase phosphorylation assay in the presence of 


demonstrated how an endogenous RNA in mammalian cells drives 
the assembly and activation of DNA-PK. The different effects of DNA- 
PKcs’ and DNA-PKcs on ribosome biogenesis mirror their effects 
on cNHE], where end-ligation is abrogated in DNA-PKcs” cells but 
not in DNA-PKcs~ cells; this suggests that, once recruited to DNA or 
RNA, the kinase activity of DNA-PKcs might allosterically regulate the 
accessibility of DNA-PK-bound DNA or RNA to regulate repair or pro- 
cessing. Moreover, U3 triggers DNA-PKcs phosphorylation at T2609, 
which is critical for rRNA processing, but relatively dispensable for 
cNHEJ, implying that RNA- versus DNA-bound DNA-PK might undergo 
different conformational changes and are subject to different regula- 
tion. In this context, we propose that the inability of DNA-PK to regu- 
late itself when assembled at U3 and/or other structured RNAs blocks 
SSU assembly and pre-rRNA processing, leading to translation defects 
that preferentially affect cell types and tissues with high demand for 
protein synthesis (for example, haematopoietic cells and ES cells). 
Although we focused on U3, our irCLIP analysis uncovered a myriad 
of other RNAs that could assemble DNA-PK, which will significantly 
expand the functional domain of DNA-PK. Finally, we noticed thatina 
7p53-deficient background, DNA-PKcs” and DNA-PKcs“ mice have 
different overall survival, which might be due to the differing effects 
of these mutations on specific RNAs or the added cNHE) defects in 
DNA-PKcs mice. Further investigation will establish the relevance 
of DNA-PK assembly at RNA and howit differs from DNA, and whether 
they could bothcollaborate to promote DNA repair and RNA process- 
ing at highly transcribed regions“. 
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increasing amounts of U3-SL1. Western blots were performed with antibodies 
against DNA-PKcs phosphorylated at the T2609 cluster (top), total DNA-PK 
(middle), and KU86 (bottom). e, Baculovirus-purified human DNA-PK in vitro 
kinase phosphorylation assay against purified human TP53. Activated DNA-PK 
phosphorylates TP53 on serine 15. Anin vitro ATM kinase assay was used asa 
control. Western blots were performed with antibodies against TP53 
phosphorylated on serine 15 (top), total TP53 (middle) and KU86 (bottom). 
d,e,n=3 biologically independent experiments. 
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Methods 


Mouse models 

DNA-PKcs“, Xrcc4”, DNA-PKcs", DNA-PKcs”, Ku70”, and Tp53*” 
alleles and mice have been described?*"**" *, The DNA-PKcs PQRallele 
substitutes five serine residues within the S2056 cluster (S2053 in 
mouse) with alanine (Extended Data Fig. 2a). The DNA-PKcs-5A muta- 
tion convertsall five threonines in the T2609 cluster of exon 58 of DNA- 
PKcs to alanine (Extended Data Fig. 2b-d). The targeting arms (5’ and 
3’ arms) were generated via PCR, cloned into pGEMT and sequence 
verified, before being subcloned into the pEMC-neo targeting vector. 
The -500-bp fragment containing all the mutations was synthesized 
(Genewiz) and integrated into the 3’ arm (4.4 kb) to generate the tar- 
geting vector (Extended Data Fig. 2b, c) with inverted neo-resistance 
(NeoR) cassettes flanked by a pair of FRT sites. An Smal-linearized 
targeting plasmid was electroporated into mouse ES cells (129/Sv 
background) and NeoR-resistant clones were isolated and screened 
via Southern blotting with Spel digestion and a 5’ probe (amplified 
via primers 5’-TCATTAAGGCCTCGCAATC-3’ and 5’-TGTTGGAGA 
AGGATGGGAAA-3’; germline, 14.7 kb; targeted, 5.8 kb). The success- 
fully targeted clones were validated via Southern blotting with aNeoR 
probe for single integration and sequenced to verify the desired muta- 
tion, before being injected for germline transmission. The resultant 
DNA-PKcs**4" chimeras (N for neo positive) were crossed with consti- 
tutively FLIPase-expressing Rosa26a™""” mice (Jax stock number: 
003946, also in 129/Sv background) to remove the NeoR cassette and 
allowthe expression of DNA-PKcs-5A. Genotyping was performed with 
primers (5’-GCAGATCTCTGTGAATT TATGACC-3’ and 5’-CCAAGAAAAA 
CTAACGAAAAATGC-3’). The product corresponding to the wild-type 
allele is 172 bp and the product corresponding to the SA allele is 244 
bp (including an FRT site). Complete blood counts were measured by 
the Institute of Comparative Medicine at Columbia University using 
Genesis (Oxford Science Inc.) and histological analyses were performed 
by the histopathological core of Herbert Irving Comprehensive Cancer 
Center (HICCC) at Columbia University. All animal work was conducted 
in an on-site, pathogen-free facility in Columbia University and fol- 
lowing procedures pre-approved by the Institutional Animal Care and 
Use Committee (IACUC) at Columbia University Medical Center. No 
statistical methods were usedto predetermine sample size. The experi- 
ments were not randomized andthe investigators were not blinded to 
allocation during experiments and outcome assessment. 


Flowcytometry analyses of haematopoietic cells and 
lymphocytes 

Flowcytometry analyses of haematopoietic cells and lymphocytes were 
performed using previously published methods”*** with minimal col- 
our modifications. In brief, single-cell suspensions of bone marrow or 
fetal liver cells were filtered through a 70-um nylon cell strainer (Falcon, 
352350). Splenocytes were treated with red blood cell lysis buffer before 
staining. For HSPC analysis, bone marrow or fetal liver cells were first 
stained witha cocktail of biotin labelled primary antibodies containing 
mouse specific lineage markers: anti-CD4 (Biolegend, 100404), anti- 
CD8a (Biolegend, 100704), anti-CD3e (Biolegend, 100304), anti-CDS 
(BD Bioscience, 553018), anti-B220 (Biolegend, 103204), anti-Ly6G/ 
Ly6C (Gr-1) (Biolegend, 108404) and anti-TER119 (Biolegend, 116204) at 
4°C for1h, before being washed and stained witha cocktail of directly 
conjugated antibodies including FITC anti-mouse CD34 (eBioscience, 
11-0341-82), PE anti-mouse Ly-6A/E (Sca-1) (Biolegend, 108108), PE/ 
Cy7 anti-mouse CD16/32 (Biolegend, 101318), APC anti-mouse CD117 
(c-Kit) (135108), APC/Cy7 Streptavidin (Biolegend, 405208) and Brilliant 
Violet 510 anti-mouse CD41 (Biolegend, 133923) at 4°C for 1.5 h. For 
lymphocyte and peripheral haematopoietic cells analyses, cells from 
the bone marrow, spleen, or thymus were stained with B cell cocktail 
(FITC anti-mouse CD43, Biolegend, 553270; PE goat anti-mouse IgM, 
Southern Biotech, 1020-09; PE-cyanineS5 anti-Hu/Mo CD45R (B220), 


eBioScience15-0452-83; and APC anti-mouse TER1J9, Biolegend 116212); 
T cell cocktail (PE rat anti-mouse CD4, Biolegend 557308; FITC anti- 
mouse CD8a, Biolegend 100706; PE/CyS anti-mouse CD3e, eBiosci- 
ence 15-0031-83; and APC anti-mouse TCRB, BD Pharmingen 553174); 
myeloid cocktail (FITC anti-mouse CD11b, BD Pharmingen 553310; 
PE rat anti-mouse CD19, BD Pharmingen 557399; PE/Cy5 anti-mouse 
CD3e, eBioscience 15-0031-83; and APC anti-mouse Ly6G/Ly6C(Gr-1), 
Biolegend 108412) and/or erythroid cocktail (PE-cyaninesS anti-Hu/ 
Mo CD45R(B220) (as above); PE/Cy7 anti-mouse TER119, Biolegend, 
116222; APC anti-mouse Ly6G/Ly6C (Gr-1) (as above); Pacific Blue anti- 
mouse Thyl.2, Biolegend 140306; and Brilliant Violet 510 anti-mouse 
CD71, Biolegend 113823), according to specific cell type containing. 
To analyse the double-negative T cells inthe thymus, total thymocytes 
were stained for FITC-anti CD25 (BD Pharmaingen, 1595-02S), and APC 
anti-mouse CD44 (eBioscience, 17-0441-82) together with acollection 
of PE-conjugated antibodiesto remove all other cell lineages (including 
anti-mouse CD4, CD8a (BD Pharmingen, 553032), CD19, TCRy6 (eBiosci- 
ence, 12-5711-82)) and PE/Cy7 anti-mouse TER119. The flow cytometry 
data were collected on either LSRII (BD) with BD FACSDiva software, 
or ona FACSCalibur (BD) with BD CellQuest Pro. All flow cytometry 
data were analyzed using FlowJo V10. 


Cell culture and cell line derivation 

The DNA-PKcs‘ and DNA-PKcs-“4 MEFs were derived from E13.5-E14.5 
embryos obtained from crosses between DNA-PKcs*™ parents using 
standard procedures?**. The DNA-PKcs“ and DNA-PKcs ES cells 
were derived by super-ovulating 3-4-week-old DNA-PKcs‘ female 
mice mated with adult male DNA-PKcs‘ mice. The inner cell masses 
from E3.5 blastocytes were then isolated and expanded for genotyping. 
The newly derived ES cells and controls were cultured on irradiated 
(30 Gy) fibroblast feeders using standard ES cell medium (DMEM, 
Gibco 12430-062; 15% fetal bovine serum, Hyclone, SH30071.03; 
1x MEM non-essential amino acids, Gibco11140-050; 1mM sodium 
pyruvate, Gibco 11360-070, 2 mM L-glutamine, Gibco 25030-081; 120 
LM 2-mercaptoethanol, Fisher 034461-100; penicillin/streptomycin, 
Gibco 15070-63 and leukaemia inhibition factor (LIF, provided by C.-S. 
(Victor) Lin)). The v-ABL kinase-transformed B cell lines were derived 
from EuBCL2* transgenic mice with the indicated genotype as previ- 
ously described”. In brief, single-cell suspension from total bone mar- 
row (from 2-5-week-old mice) or fetal liver was infected with retrovirus 
encoding p120 minimal v-ABL kinase*’. Clonal outgrowth was isolated 
and expanded for the next 6-8 weeks. CRISPR-Cas9-mediated deletion 
of Ku80 was performedas previously described”. 

The Fancd2’ MEFs’ were generously provided by A. Smogorzewska 
(Rockefeller University). The v-ABL kinase-transformed DNA-PKcs" 
B cell lines!”"® were generously provided by B. Sleckman (Cornell Uni- 
versity). 


Measurement of protein synthesis using OPP 
For in vitro analyses of cell lines, 5 x 10° v-ABL kinase-transformed B 
cells were plated in each well of a 24-well plate. ATM or DNA-PK kinase 
inhibitors were added 17 h before the cells were treated with 5 1M OPP 
(ThermoFisher) for 30 min. To prevent protein syntheses (establishing 
anegative control), 100 pg/ml cycloheximide (CHX; Sigma) was added 
at the same time as OPP ina subset of samples. For all the translation 
experiments, the azide-alkyne cycloaddition was performed using the 
OPP Protein Synthesis Assay Kit (ThermoFisher Scientific). In brief, the 
cells were washed with ice-cold PBS, fixed with 3.7% formaldehyde in 
PBS for 15 min at 25 °C, permeabilized with 0.5% Triton X-100/PBS for 
15 min at 25 °C, and then subjected to azide-alkyne cycloaddition and 
detection with azide-conjugated Alexa Fluor 488. The data were col- 
lected using FACSCaliber (BD Biosciences for cell lines) or BD FACSDiva 
(for haematopoietic cells) and analysed using FlowJo V10. 
Translation analyses of haematopoietic cells from mice were con- 
ducted as previously described with minor modifications”. In brief, 
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fresh isolated marrow and splenocytes were cultured for 1 hin RPMI 
1640 complete medium (Gibco 11875) with 15% FBS containing 5 uM OPP 
before staining of live haematopoietic stem cells (see above with minor 
modification of the colour to accommodate FITC-OPP). Specifically, 
Alexa Fluor 700 rat anti-mouse CD34 (eBioscience, 560518) and Ly-6A/E 
(Sca-1) and Pacific Blue anti-mouse Ly-6A/E (Sca-1) (Bioleged, 108120) 
were used for HSPC staining. The relative levels of protein synthesis 
between different haematopoietic populations obtained from transient 
in vitro incubation of bone marrow from 2-week-old DNA-PKcs‘ mice 
(Extended Data Fig. 7d) agreed with prior data obtained with in vitro and 
in vivo injection of OPP in adult mice”. Specifically, among the HSPC 
and lineage specific progenitors, the highest protein translation was 
found in megakaryocytes and erythroid progenitors (MEPs), followed 
by common myeloid progenitors (CMPs). HSPCs (Lin‘SCA1*c-KIT’; 
LSK) have relatively low protein synthesis”’ (Extended Data Fig. 7d). 
To offset the prohibitory cost of in vivo OPP injection, we measured 
global translation with transient (1h) in vitro incubation. 


Northern blot analysis of rRNA processing 

Total RNA was extracted using TRIzol (Life Technology) and about 5 pg 
total RNA was analysed for each sample. Oligo probes measuring 18S 
and 5.8S rRNA processing situated at the 3’ boundary of the mature 18S 
or 5.8S species were designed on the basis of previous publications**”” 
(Supplementary Table 1). 


Immunofluorescence 

U20S cells or mouse ES cells were seeded on gelatin-treated coverslips 
48 h before staining and stained as previously described”. In brief, the 
cells were washed with 1 x PBS, when indicated, incubated with CSK 
buffer? (10 mM Pipes with pH 7.0, 100 mM NaCl, 300 mM sucrose, 
3 mM MgCl, and 0.7% Triton X-100) for 3 min at room temperature 
for pre-extraction, then washed and fixed with 4% paraformaldehyde 
(PFA) in PBS for 25 min at 25 °C. Before staining, the cells were per- 
meabilized with 0.1% Triton X-100/PBS for 10 min and blocked with 
3% BSA for 1h at 25 °C. Fixed cells were then incubated with primary 
antibodies in 3% BSA for Lhat 25 °C, including mouse anti-human KU86 
(ThermoFisher, MA5-12933, 1:100), rabbit anti-human DDX21 (Novus, 
NB100-1718, 1:500) or anti-DNA-PKcs (ThermoFisher, Ab-4(cocktail)), 
followed by fluorophore-conjugated secondary antibodies (Alexa Fluor 
488-conjugated anti-rabbit, Alexa Fluor 594-conjugated anti-rabbit, 
and cyanine3-conjugated anti-mouse, Invitrogen, 1:500) for 1hatroom 
temperature. All images were captured on a Nikon AIRMP confocal 
microscope with a 63x objective. 


Cell cultures 

HeLaandU2OS cells were cultured in DMEM (plus 10% FBS) and main- 
tained under standard tissue-culture conditions unless otherwise 
specified. For actinomycin D (Sigma) treatments, 1 pg/ml was added 
totheculture medium for 1h before harvesting for downstream assays. 
For immunofluorescence studies, cells were fixed with 4% paraform- 
aldehyde, unless otherwise specified, and stained with the indicated 
antibodies. All cell lines used in this study were mycoplasma-free. 


Infrared crosslinking and immunoprecipitation 

irCLIP was performed as described*. HeLa cells were grown as 
described above and UV crosslinked with a total of 0.35 J/cm’. Whole- 
celllysates were generated in CLIP lysis buffer (SO mM HEPES, 200 mM 
NaCl, 1mM EDTA, 10% glycerol, 0.1% NP-40, 0.2% Triton X-100, 0.5% 
N-lauroylsarcosine) and briefly sonicated using a probe-tip Branson 
sonicator to solubilize chromatin. Each experiment was normalized to 
total protein amount, typically 1mg, and partially digested with RNase 
A(ThermoFisher Scientific, ENO531) for 10 min at 37 °C and quenched 
onice. DNA-PKcs (Bethyl, A303-967A), KU80 (Bethyl, A302-627A) and 
IgG (ThermoFisher Scientific, 02-6102) IPs were performed with 15 pg 
antibody and 50 pl Protein G Dynabeads (ThermoFisher Scientific) for 8 


hat 4 °C onrotation. Samples were washed for 1 min at 25 °C eachtime 
in1mlof the following buffers sequentially: 1 x high stringency buffer 
(15 mM Tris-HCl, pH 7.5, 5 mM EDTA, 2.5 mMEGTA, 1% Triton X-100, 1% 
sodium deoxycholate, 120 mM NaCl, 25 mM KCl), 1 x high salt buffer 
(15 mM Tris-HCl pH 7.5, 5 mM EDTA, 2.5 mM EGTA, 1% Triton X-100, 1% 
sodium deoxycholate, 1 M NaCl), 2 x NT2 buffer (50 mM Tris-HCl, pH 
7.5,150 mM NaCl, 1mM MgCl,, 0.05% NP-40). After the NT2 wash, RNA- 
protein complexes were dephosphorylated with T4 PNK (NEB) for 45 
min in an Eppendorf Thermomixer at 37 °C with repeated cycle of 15s 
at 1,400 rpm followed by 90s of rest ina30-l reaction system of pH 6.5 
containing 10 units T4 PNK, 0.1 pl SUPERase-IN (ThermoFisher Scien- 
tific), and 6 pl PEG-400 (16.7% final). Dephosphorylated RNA-protein 
complexes were then rinsed once with NT2 buffer and 3’-end ligated 
with T4 RNA Ligase 1 (NEB) overnight in an Eppendorf Thermomixer 
at 16 °C with a repeated cycle of 15s at 1,400 rpm followed by 90 s of 
rest ina 60-l reaction containing 10 units T4 RNA Ligase, 1.5 pmol pre- 
adenylated-IR800-3’biotin DNA-adaptor, 0.1 p1 SUPERase-IN, and 6 ul 
PEG400 (16.7% final). The following day, samples were rinsed once again 
with 500 pI NT2 buffer and resuspended in 30 pl NT2 buffer containing 
20 mM DTT and1xLDS (ThermoFisher Scientific). Samples were then 
heated to 75 °C for 10 min, and released RNA-protein complexes were 
separated on 4-12% Bis-Tris SDS-PAGE gel (1.0 mm x 12 well) at 200 V 
for 45 min. Resolved RNP complexes were wet-transferred to nitrocel- 
lulose at 550 mA for 45 min at 4 °C. 

Nitrocellulose membranes were imaged using an Odyssey CLx scan- 
ner (LiCor), RBP-RNA complexes were excised using scalpels, and RNA 
was recovered by adding 0.1 ml proteinase K reaction buffer (100 mM 
Tris, pH 7.5, 50 mM NaCl, 1mM EDTA, 0.2% SDS and 5 pl of 20 mg/ml 
proteinase K (ThermoFisher Scientific)). Proteins were digested for 
60 min at 50 °Cin an Eppendorf Thermomixer. Next, 200 pl saturated 
phenol-chloroform pH 6.7 was added to eachtube and incubated for 10 
min at 37 °C in an Eppendorf Thermomixer at 1,400 rpm. Tubes were 
briefly centrifuged, andthe entire contents were transferred toa 2-ml 
Heavy Phase Lock Gel tube (SPrime, 2302830). Samples were centri- 
fuged for 2 min at >13,000 rpm. The aqueous layer was re-extracted with 
1ml chloroform (inverting 10 times to mix; no vortexing) in the same 
2-ml Phase Lock Gel tube and centrifuged for 2 min at >13,000 rpm. 
The aqueous layer was then transferred to anew 2-ml Heavy Phase Lock 
Gel tube and extracted again with an additional 1 ml chloroform. After 
2 min centrifugation at >13,000 rpm, the aqueous layer was transferred 
to a siliconized 1.5-ml tube and precipitated overnight at —20 °C by 
addition of 10 pl 5M NaCl, 3 pl linear polyacrylamide (ThermoFisher 
Scientific) and 0.8 ml 100% ethanol. RNA fragments were pelleted at 
>13,000 rpm for 45 min at 4 °C, washed once with 1 ml ice-cold 75% 
ethanol and air dried. 

RNA pellets were resuspended in 12 pl water by adding 1 pl of 3 uM 
cDNA and 1 pl of 10 mM dNTPs, and heated to 70 °C for 5 min before 
being rapidly cooled to 4 °C. cDNA Master Mix (4 pI 5x Super Script IV 
(SSIV) buffer, 1 11100 mM DTT, 1 pI SSIV, 6 pl total) was added to the 
annealed RNA and incubated for 30 min at 55 °C. cCDNA-RNA hybrids 
werecaptured by addition of 5 pl of MyOne Streptavidin C1 Dynabeads 
(ThermoFisher Scientific) that had been rinsed and suspended in 50 ul 
biotin-IP buffer (100 mM Tris, pH 7.5, 1M NaCl, 1mM EDTA, 0.1% Tween), 
and subjected to end-over-end rotation for 45 min at room temperature. 
Beads were placed on a 96-well magnet and washed twice with 100 pl 
biotin-IP buffer and 100 ul ice-cold 1 x PBS sequentially. Beads were 
resuspended in 10 pI cDNA elution buffer (8.25 pl water adding 1 pl of 
1uM P3 short oligo and 0.75 pl of S0 mM MnCl) and heated to 95 °C 
for 10 min, ramped at 0.1° sto 60 °C until the end of the experiment. 
cDNA was circularized for 2h at 60 °C in 5 pl circularization reaction 
buffer (3.3 pl water, 1.5 1 10x Circligase-II buffer, and 0.5 pl Circligase-Il 
(Epicentre)), and then purified with 30 pl AMPure XP beads (Beckman 
Coulter) and 75 pl isopropanol. Samples were incubated for 20 min at 
25 °C, washed twice with 100 pl 80% ethanol, air dried for 5 min, and 
eluted in14 pl water. Elution took place at 95 °C for 3 min and samples 


were then immediately transferred to a 96-well magnet. Eluted cDNA 
was transferred to anew PCR tube containing 15 pl 2x Phusion HF-PCR 
Master Mix (NEB), 0.5 pl of 30 pM P3/P6 PCRI oligo mix and 0.5 pl of 
15x SYBRGreen | (ThermoFisher Scientific). Real-time quantitative PCR 
was performed as: 98 °C for 2 min, 15 cycles of 98 °C for 15s followedby 
65 °C for 30s, and 72 °C for 30 s, and eventually with data acquisition 
set to the 72 °C extension. PCR1 reactions were purified by adding 4.5 ul 
isopropanol and 54 pl AMPure XP beads before incubation for 10 min. 
Beads were washed once with 80% ethanol, dried for 5 min, and eluted 
in 15 pl water. Illumina flow cell adaptors were added by adding 15 ul 
2x Phusion HF-PCR Master Mix and 0.4 pI P3solexa/Pésolexa oligo mix 
and amplified as: 98 °C for 2 min, 3 cycles of 98 °C for 15s followed by 
65 °C for 30s, and 72 °C for 30s seconds. Final libraries were purified 
by addition of 48 pp! AMPure XP beads and incubation for 5 min. Beads 
were washed twice with 70% ethanol, dried for 5 min, and elutedin 20 pl 
water. Libraries (1-2 p11) were quantified using an HS-DNA Bioanalyzer. 
Samples were deep sequenced on the Illumina NextSeq machine with 
75-bp cycle run of single-end and no index high-output sequencing. 


Analysis of irCLIP data 

The irCLIP data were processed using the FAST-iCLIP pipeline (https:// 
github.com/ChangLab/FAST-iCLIP/tree/lite). PCR duplicates were 
removed using unique molecular identifiers (UMIs) inthe RT primer 
region. Adaptor and barcode sequences were trimmed, and reads were 
mapped step-wise to repetitive and then non-repetitive (GRCh38) 
genomes. Specific parameters used for the FAST-iCLIP pipeline are as 
follows: -f18 (trims 17 nt from the 5’ end of the read), -116 (includes all 
reads longer than 16 nt), -bm 29 (minimum MAPQ score from bowtie2 
of 29 is required for mapping; unique mapping only), and -tr 2,3 (repeti- 
tive genome) and -tn 2,3 (non-repetitive genome) RT stop intersection 
(n,m; where n= replicate number and m= number of unique RT stops 
required per nreplicates). Using the -tr/tn 2,3 parameters, aminimum 
of six RT stops is required to support any single nucleotide identified 
as crosslinking site. For gene ontology analysis, the DAVID® tool was 
employed (https://david.ncifcrf.gov/tools.jsp). Peaks of RT stops were 
called on the biological replicated intersection of RT stop positions 
using iCount peaks (http://icount.readthedocs.io/en/latest/). The 
command line was as follows: iCount peaks gencode.v21.annotation. 
segment.gtfRTstop_input.bed Out_iCpeaks.bed-scores Out_iCpeaksS- 
cores.tsv. Regions from ‘Out_iCpeaks.bed’ were then annotated using 
HOMER (http://homer.ucsd.edu/homer/) using the following com- 
mand: annotatePeaks.pl Out_iCpeaks.bed hg38> Out_iCpeaks_hg38_ 
HOMERanno.txt -annStats Out_iCpeaks hg38 HOMERanno stats.txt. 


Comprehensive identification of RNA binding proteins by mass 
spectrometry 

U3 snoRNA targeting probes were designed using online tools avail- 
able at https://www.biosearchtech.com/stellaris, with a repeat mask- 
ing setting of 3 and even coverage of the whole transcript. Full probe 
sequencesare available in Supplementary Table 2. Oligos were synthe- 
sized with 3’ biotin-TEG modification at Stanford Protein and Nucleic 
Acid Facility (panoligo@stanford.edu). 

ChIRP-MS was performed largely as described. Five 15-cm dishes of 
cells (~-200 million cells) were used per ChIRP-MS experiment. Cell cul- 
ture and ActD treatments were performed as described above. Medium 
was aspirated and cells were rinsed once with 5 ml of 1 x PBS. PBS was 
then aspirated, and 10 ml of 3% formaldehyde in 1x PBS was added to 
each dish. Cells were scraped off the 15-cm dishes during chemical 
crosslinking, which took place for a total of 30 min at 25 °C and was 
stopped by the addition of glycine to a final concentration of 125 mM 
for 5 min at 25 °C. Crosslinked cells were pelleted at 2,000 rpm for 
5min, supernatant was discarded, and cells frozen at —80 °C for storage. 
Lysate was generated by resuspending cell pellets in 1 ml lysis buffer 
(50 mM Tris-HCI pH 7.0, 10 mM EDTA, 1% SDS) per 100 mg of cell pellet 
weight (-100 pl pellet volume). Lysates were sonicated using a focused 


ultrasonicator (Covaris, E220) until the average RNA length was 500 
nucleotides as determined by agarose gel analysis and stored at —80 °C. 
Stored lysates were thawed on ice and prepared for pre-clearing and 
RNase A treatment (control sample). The experimental samples were 
precleared by adding 30 pl washed MyOne C1 beads per ml of lysate 
at 37 °C for 30 min on rotation. To control sample preclearing, RNA 
digestion was achieved by adding 30 pl washed MyOne C1 beads per 
ml lysate and 1/1,000 the volume of 10 mg/ml RNase A (Sigma) and 
incubating at 37 °C for 30 min on rotation. Preclearing beads were 
collected twice from lysate using a magnetic stand; for this and all 
subsequent magnetic stand steps we allowed >1 min of separation 
before removing any supernatant. Next, 2 ml of ChIRP hybridization 
buffer (750 mM NaCl, 1% SDS, 50 mM Tris-HCI pH 7.0, 1 mM EDTA, 15% 
formamide; made fresh) and 2.5 pl of 10O pM ChIRP Probe Pools were 
added per ml of lysate. ChIRP Probe Pools (Supplementary Table 1) 
werecomprised of seven antisense oligos (final concentration of 16 uM 
of each probe). Hybridization took place on rotation for 16 h at 37 °C. 
Subsequently, 250 pl ofwashed MyOne C1 beads per ml of lysate were 
added to each sample and incubated on rotation for 45 min at 37 °C. 
Enriched material was collected on the beads with a magnetic stand, 
and beads were washed 5 x 2 min in 1 ml ChIRP Wash Buffer (2 x NaCl- 
sodium citrate (SSC, ThermoFisher Scientific), 0.5% SDS) at 37 °C. To 
elute enriched proteins, beads were collected on a magnetic stand, 
resuspended in ChIRP biotin elution buffer (12.5 mM biotin, 7.5 mM 
HEPES, pH 7.9, 75 mM NaCl, 1.5 mM EDTA, 0.15% SDS, 0.075% sarkosyl, 
and 0.02% Na-deoxycholate), and mixed at 25 °C for 20 min on rotation 
and at 65 °C for 15 min shaking. Eluent was transferred toa fresh tube, 
and beads were eluted again. The two eluents were pooled (~1,200 pl), 
and residual beads were removed again using the magnetic stand. 
Twenty-five per cent total volume (300 pl) of trichloroacetic acid was 
added totheclean eluent and vortexed, andthensamples were placed 
at 4 °C overnight for precipitation. The next day, proteins were pelleted 
at 21,000 relative centrifugal force (rcf) at 4 °C for 45 min. Supernatant 
was carefully removed, and protein pellets were washed once with 
ice-cold acetone. Samples were spun at 21,000 rcf at 4 °C for 5 min. 
Acetone supernatant was removed and tubes were briefly centrifuged 
again and, after removal of residual acetone, were left to air-dry ona 
bench-top. Proteins were then solubilized in1x LDS buffer in NT2 with 
20 mM DTT and boiled at 95 °C for 30 min with occasional mixing for 
reverse-crosslinking. 

Protein samples were size-separated on bis-tris SDS-PAGE gels (Bio- 
Rad), and the gel was fixed and stained with the Colloidal Blue Staining 
Kit (ThermoFisher Scientific) as per the manufacturer’s instructions. 
Each ChIRP-MS experiment was cut into seven slices from the SDS- 
PAGE and prepared independently. Gel slices were prepared for mass 
spectrometry by rinsing sequentially in 200 pl HPLC-grade water, 100% 
acetonitrile (ACN, ThermoFisher Scientific) and 50 mM ammonium 
bicarbonate (AmBic). Samples were reduced by adding 200 ul of 5 mM 
DTTin50 mM AmBic and incubating at 65 °C for 35 min. The reduction 
buffer was discarded, andsamples were cooled toroom temperature. 
Alkylation was achieved by adding 200 ul of 25 mM iodoacetaminde 
in50 mM AmBic for 20 min at 25 °C in the dark. The alkylation buffer 
was discarded, samples were rinsed once in 200 p50 mM AmBic, and 
then they were washed twice for 10 min each in 200 ul of freshly pre- 
pared 50% ACN in 50 mM AmBic. After each wash, the supernatant 
was discarded, and after all washes, samples were dried for 3 h using 
a SpeedVac. Once dry, the proteins were digested by adding 100 ng 
trypsinin 200 pl of 50 mM AmBic for 16 hat 37 °C. Samples were sub- 
sequently acidified by adding formic acid to a final concentration of 
2.5% and incubating at 37 °C for 45 min. Finally, samples were desalted 
using HyperSep Filter Plates with a 5—7-pl bed volume (ThermoFisher 
Scientific) following the manufacturer’s instructions. Samples were 
eluted three times in 100 pl 80% ACN in 2.5% formic acid, dried ona 
SpeedVac, and resuspended in 10 pl 0.1% formic acid for mass spec- 
trometry analysis. Desalted peptides were analysed by online capillary 
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nanoliquid chromatography with tandem MS (nanoLC-MS/MS). Sam- 
ples were separated using a 20-cm reversed phase column fabricated 
in-house (100 pm inner diameter, packed with ReproSil-Pur C18-AQ 
3.0 um resin (Dr. Maisch GmbH)) that was equipped witha laser-pulled 
nanoelectrospray emitter tip. Peptides were eluted at a flow rate of 400 
nl/min using a two-step linear gradient of 2-25% buffer Bin 70 min and 
25-40% B in20 min (buffer A: 0.2% formic acid and 5% DMSO in water; 
buffer B: 0.2% formic acid and 5% DMSO inacetonitrile) in an Eksigent 
ekspert nanoLC-425 system (AB Sciex). Peptides were ionized with 
electrospray ionization into an Orbitrap Elite Hybridlon Trap-Orbitrap 
Mass Spectrometer (ThermoFisher Scientific). Instrument method 
parameters were as follows: MS1 resolution, 60,000 at 400 m/z; scan 
range, 340-1,600 m/z. The top 20 most abundant ions were subjected 
tocollision-induced dissociation with a normalized collision energy of 
35%, activation q 0.25, and precursor isolation width 2 m/z. Dynamic 
exclusion was enabled with a repeat count of 1,arepeat duration of 30s, 
and an exclusion duration of 20s. 

FASTA sequences of the human proteome (Uniprot: UPO00005640) 
were downloaded and used to search the data files (.raw) using Byonic 
(Protein Metrics) with the following parameters: semi-specific cleav- 
age specificity at the C-terminal site of R and K allowing for 2 missed 
cleavages. Mass tolerance was set at 12 ppm for MSIs, 0.4 for MS2s. 
Methionine oxidation, asparagine deamidation, and N-term acetylation 
were set as variable modifications. Cysteine carbaminomethylation was 
set asa fixed modification. Peptide hits were filtered using a 1% FDR. 
To visualize ChIRP-MS protein hits, paired Uniprot ID and ChIRP-MS 
enrichment values were imported into Cytoscape using the GeneMA- 
NIA extension™. Connections between hits visualized were selected to 
display only direct protein-protein interactions. Each protein hit was 
coloured by the enrichment value (log,ChIRP/log,RNase). 


ChIRP-qRT-PCR 

Cells were grown, crosslinked, and sonicated as described above. After 
sonication, 1% of the lysate was removed and saved as an ‘input’ sam- 
ple. Lysates were again processed as above for preclearing, hybridiza- 
tion, MyOneC1 capture, and bead washing. After washing, 1% of each 
sample was removedas an ‘enriched’ fraction. Enriched fractions were 
collected while the MyOneCl beads were fully resuspended in ChIRP 
wash buffer. The input and enriched samples were brought to 95 wlin 
ChIRP PK buffer (10 mM Tris-HCI pH 7.0, 100 mM NaCl, 1 mM EDTA, 
0.2% SDS) and to this was added 5 pl of 20 mg/ml proteinase K. Protein 
was digested while shaking at 55 °C for 45 min. RNA was extracted by 
adding 500 ul TRIzol (ThermoFisher Scientific), incubating at 55 °C 
for 5 min, and then adding 100 ul chloroform. After mixing samples 
by vortexing for 7 s each, samples were incubated at 25 °C for 5 min 
and then spun at 12,000 rpm at 4 °C for 15 min. The aqueous layer 
was carefully removed from each sample, mixed with two volumes 
of 100% ethanol, and purified using an RNA Clean & Concentrator-25 
(Zymo Research) per the manufacturer’s instructions. All RNA sam- 
ples were DNase-treated with the Turbo DNA-Free kit (ThermoFisher 
Scientific). The cDNA was generated using SuperScript VILO (Ther- 
moFisher Scientific) according to the manufacturer’s instructions. 
The qPCR analyses were performed onthe CFX96 Touch Real-Time 
PCR Detection System (Bio-Rad). All primers used are shown in 
Supplementary Table 2. 


Secondary structure folding of U3 snoRNA 

The human U3 snoRNA sequence was obtained (NR_006880.1) and 
secondary structure folded using the mFold web server™ using default 
settings. A Vienna file (.b) was exported for the folded structure and 
visualized in VARNA*. 


Electrophoretic mobility shift assay 
The stem-loop 1 region of U3 was synthesized with a T7 polymerase pro- 
moter and subsequently in vitro transcribed (IVT) using the MEGAscript 


T7 Transcription Kit (Thermo Fischer Scientific) per the manufacturer's 
protocol to produce IVT-U3-SL1. To fluorescently label IVT-U3-SL1, 
periodate labelling of RNA 3’ends was performed using Cy7-amine 
(Lumiprobe) as the dye. To perform the electrophoretic mobility shift 
assay (EMSA) reaction, IVT RNA was first denatured at 75 °C for 5 min, 
cooled rapidly on ice for 3 min, and then added to 1x EMSA buffer 
(10 mMHEPES pH7.5, 20 mM KCI, 1mM MgCl,,and1mM DTT) for5 min 
at 37 °Cto allow the RNA to refold. For reactions that contained non- 
labelled competitor RNA or DNA, these were added atthe refolding step. 
Non-labelledIVT-U3-SL1 and PCR DNA of U3-SL1 was added at 1:1or 5:1 
as unlabelled:labelled nucleic acids. For the DNA competitor, each end 
of the dsDNA was considered independently inthe molar calculations. 
Finally, to each reaction we added 1 pl of 50% glycerol and recombinant 
DNA-PK holoenzyme (Promega), and water to 10 pl. EMSA reactions 
were incubated at 25 °C for 30 minand then run directly ina 4% native 
PAGE at 100 V. After electrophoresis, gels were directly scanned ona 
LiCor Odyssey in the 800 channel. 


Invitro DNA-PK kinase reactions 

IVT RNA, dsDNA, and DNA-PK holoenzymewere prepared as described 
above for EMSA analysis. Kinase reactions were assembled with 1 pl 
recombinant DNA-PK holoenzyme (Promega) and titrated amounts 
of nucleic acids (RNA, DNA, or none). To test for the dependence on 
ATP hydrolysis, 200 tM ATP, no ATP, or 200 uM adenosine-5’-[(B,y)- 
methyleno]triphosphate (AppCp) was added. When indicated, the 
DNA-PKcs inhibitor NU7441 was added to a final concentration of 
1pM. All reactions occurred for 60 min at 25 °C and were subsequently 
assayed by western blotting. The baculovirus-purified human DNA-PK 
(Thermo PV5866), TP53 (Millipore 23-034) and active human ATM 
(Millipore 14-933) were used in the assays. The kinase reactions were 
performed in 1x kinase buffer (Thermo PV3189) with 100 pM ATP and 
1.5 plrecombinant DNA-PK holoenzyme or 1.0 pl ATM inthe presence of 
TP53. Allreactions occurred for 60 min at 25 °C and were subsequently 
assayed by western blotting. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


irCLIP data are available via the Gene Expression Omnibus (GEO) under 
the accession number GSE109026. All uncropped blots are provided in 
Supplementary Fig. 1. Data underlying the graphical representations 
used in the figures, including all experiments presenting data from 
animal models, are provided as Source Data. Exact P values and defined 
sample sizes (n) are provided in Supplementary Data 1. 


Code availability 


The FAST-iCLIP software is freely available at https://github.com/ 
ChangLab/FAST-iCLIP/tree/lite. 
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Extended Data Fig. 1| Myeloid proliferative disease with MDS features in 
DNA-PKcs®"°Tp53” mice. a, Representative fluorescence-activated cell 
sorting (FACS) analyses of pro-B cell lymphomas from DNA-PKcs" Tp53~ 
mice. b, The frequency of megakaryocytes with hyperchromatic or pyknotic 
nucleiand total megakaryocytes per 400 fieldinthe spleen. Eight fields from 
three independent DNA-PKcs*°Tp53” mice were quantified and are 
presented here. Mean +s.e.m.; two-sided unpaired student’s t-test, **P< 0.01, 
*“*P<0.001.c, Representative FACS analyses of myeloid proliferative disease 
in DNA-PKcs®Tp53~ mice. The CD19-negative bone marrowfrom 
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type bone marrow, as DNA-PKcs” Tp53” mice have no pre-B and matureB 
cells (CD19*) owing to cNHE)J defects. Thereis also an increase in the percentage 
of c-KIT' cellsamong the myeloid cells (CD11b*GRI‘) from DNA-PKcs" Tp53” 
mice. Mid-sized reticulocytes and RBC progenitors are reduced in the 
DNA-PKces*”"“°Tp537 bone marrow. Exact Pvalues and defined sample sizes (n) 
are providedin Supplementary Datal. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2| Generation and characterization of the 
DNA-PKcs?" and DNA-PKcs“ mouse models. a, Diagram of the protein 
sequence changes in the DNA-PKcs’ allele’®. b,c, Diagram of the protein 
sequence changes in the DNA-PKcs™ allele and the targeting scheme. The TQ 
sites are shown in upper case for ease of identification. Asterisks denote sites 
mutated in the previously published DNA-PKcs* allele’. The schematic 
diagram (not to scale) represents the mouse Prkdc (DNA-PKcs) locus (top), 
targeting vector (second row), targeted allele (DNA-PKcs*", third row), and the 
neo-deleted mutant allele (DNA-PKcs™, bottom). The red exoncontains the 
mutations. Boxes, exons; triangles, FRT sites. d, Southern blot analyses of 


Spel-digested DNA from DNA-PKcs and DNA-PKcs" (with NeoR) ES cells. 

e, f, The concentration of RBCs and platelets (PLT) in the peripheral blood of 
5-month-old DNA-PKcs??? mice and DNA-PKcs“ controls. g, The platelet 
concentration in the peripheral blood of 2-week-old DNA-PKcs”*, DNA-PKcs“ 
and DNA-PKcs“ mice.h, i, Representative images show pale kidney (h) and liver 
(i) from 2-week-old DNA-PKcs mice, indicating extreme anaemia. 

e-g, Mean +s.e.m.; two-sided unpaired Student’s t-test, ***P< 0.001; n.s., not 
significant, P> 0.05). Exact Pvalues and defined sample sizes (n) are provided 
in Supplementary Datal1. 
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Extended Data Fig. 3 | Haematopoiesis and translation defects in 
DNA-PKcs mice. a, b, Flow cytometry analyses (a) and quantification (b) 
of 2-week bone marrow progenitor cell populations. The same gating strategy 
was used throughout to define progenitor populations. Lin SCAI‘c-KIT* 
haematopoietic stem and progenitor cells (LSK, HSPC) and LK cells 

(Lin°SCAT c-KIT’) were further divided into CMPs, granulocyte-monocyte 
progenitors (GMPs) and MEPs”’.c, d, Flow cytometry analyses (c) and 


Frequency in fetal liver (%) 


quantification (d) of E14.5 fetal liver HSPC (Lin SCAI‘c-KIT’) frequency from 
DNA-PKcs" and DNA-PKcs“ embryos. e, Flow cytometry gating strategy for 
analysing proteintranslation in Lin’ cells. The gating strategy for progenitors is 
shown ina. This strategy (a, e) was used to determine global translation for 
each population in Extended Data Fig. 7d. Mean +s.e.m.; two-sided unpaired 
Student’s t-test; ***P< 0.001, *P<0.05,n.s. P>0.05. Exact Pvalues and defined 
sample sizes (n) are provided in Supplementary Datal. 
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Extended Data Fig. 4| See next page for caption. 


Extended Data Fig. 4 | Normal V(D)J recombinationin DNA-PKcs“ mice and 
cells. a, Flowcytometry analyses of lymphocyte development in DNA- 
PKces*“““Tp537 and control mice. b, Frequency of B220*IgM‘ naive mature 

B cells from spleen. c, Quantification of peripheral RBCs in DNA-PKcs mice 
withand without TP53 deficiency. Note that wild-type, DNA-PKcs“ and 
DNA-PKcs“ data from Fig. 2b are included here for comparison. d,e, Absolute 
counts of peripheral lymphocytes (d) and neutrophils (e) in DNA-PKcs“ mice 
with or without 7p53 or Ku70 deficiency. TP53-deficiency rescued both the 
lymphocytopaenia and the neutropaenia in DNA-PKcs mice. Notably, 

KU70 deficiency caused severe lymphocytopaenia by itself or with the 
DNA-PKcs mutation, but rescued the neutropaenia in DNA-PKcs™™ mice. 
f-h, Chromosomal V(D)J recombination measured in v-ABL kinase-transformed 
B cells. f, Diagram of pMX-INV chromosomal V(D)J recombination substrate”. 


Empty and filled triangles represent recombination signal sequences. 

RV, EcoRV, N, Ncol. Dashed line marks the C4 probe used for Southern blotting. 
g, Southern blotting reveals the accumulation of normal coding joins (CJ) and 
signal joins (SJ) and suggests successful V(D)J recombination in DNA-PKcs” 
Bcells.h, Successful V(D)J recombination places Gfpinthe same orientation as 
the promoter and leads to GFP expression’. Plots show the frequency of GFP* 
cells after 4 days of STI571 (3 pM) treatment. i, Quantification of DN2and DN3 
cells out ofall double-negative cells (left, reflecting TCRB V(D)J recombination) 
and percentage of CD4* or CD8* single-positive (SP) cells (right, reflecting 
TCRa V(D)J recombination). b-e, i, Mean+s.e.m.; two-sided unpaired Student’s 
t-test.***P< 0.001, **P<0.01,*P<0.05,n.s. P> 0.05. Exact Pvalues and defined 
sample sizes (n) are provided in Supplementary Datal. 
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Extended Data Fig. 5 | DNA-PKcs MEFs and ES cells are not hypersensitive 
toDNA-damaging agents. a, b, DNA-PKcs“ MEFs are not hypersensitive to 
ionizing radiation (IR) or mitomycin C (MMC). Fancd2” cells" are included as 
positive controls.c-e, Representative colony formation assays (c) and 
quantification of sensitivity to MMC (d) and IR (e) in DNA-PKcs and control 
ES cells. Mean +s.e.m. ofa representative experimentin triplicate. Two-sided 


6 Area Pl 


unpaired Student’s t-test; P> 0.05 (n.s). f, Normal proliferation curve of DNA- 
PKcs*4*4 MEFs. g, DNA-PKcs*“*4 MEFs have normal S phase frequency. The 
accumulation of aneuploid cells is unique to 7p53” cells. a, d-f, Mean+ts.e.m. 
Exact Pvalues and defined sample sizes (n) are provided in Supplementary 
Datal. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | DNA-PKcs ES cells have KU-dependent 
proliferation defects that cannot be fully explained by cNHE) or telomere 
defects. a, Representative colony formation assay for DNA-PKcs“ ES cells. 
Note the frequent accumulation of small colonies in DNA-PKcs™™ cells, which is 
rescued by deletion of KU80. b, Quantification of colony size (A.U., arbitrary 
units) from DNA-PKcs and control ES cells. Two independently derived ES 
cells were assayed. c, Cell cycle analyses of two independently derived DNA- 
PKcs* and control ES cell lines. The percentage of BrdU’ cells is shown. 

d, Frequency of metaphase with telomere abnormalities (see below for 
definition). e, Representative telomere fluorescence in situ hybridization 
(FISH) images of anormal mouse chromosome (top, with four telomere dots), a 
chromosomewitha chromatid break (middle, showing loss of one telomere 
signal among the two sister chromatids), ora chromatid fusion without 
telomere signal (bottom). f, Quantitative analyses of telomereinstability and 
chromosomal breaks in metaphase. Telomere FISH analyses of MEFs were 
performed with thetelomere-specific PNA probeas previously described’. 
Normal mouse chromosomes have four discrete telomere signals (e, top). 


Telomere instability or breaks considered include: i) telomere instability 
(indicated by more than one telomere signal per chromatid), ii) telomere/ 
chromosome fusion (e, bottom; with telomere at the fusion junction (telomere 
fusion) or without telomere signal at the fusion junction (non-telomere 
fusion)), iii) chromosome breaks (S.B.; loss of both telomere signals on the 
paired sister chromatids) and iv) chromatid breaks (T.B.; loss of one of thetwo 
chromatids (e, middle)). The number of metaphases with at least one telomere 
instability, break or fusion is shown in das a percentage of metaphases with 
abnormalities. Data derived from four independent MEF lines of each 
genotype. g, Representative flow cytometry analyses of erythroblasts from 
age-matched (2 weeks) DNA-PKcs" and DNA-PKcs” mice. h, Representative 
protein translation analyses of S1(CD7I'Terl119_), S2 (CD71‘Ter119™*) and S3 
erythroblasts (CD7I’Ter119"*") from 2-week-old DNA-PKcs**, DNA-PKcs’ and 
Ku707% mice. Quantification is shown in Fig. 2i.b,d, Mean +s.e.m.; two-sided 
unpaired Student’st t-test, **P< 0.01, *P<0.05,n.s. P> 0.05. Exact Pvalues and 
defined sample sizes (n) are provided in Supplementary Datal. 
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Extended Data Fig. 7 | Global translation defects in DNA-PKcs“ 
haematopoietic cells. a, MCV of RBCs from 2-week-old DNA-PKcs“ mice with 
or without TP53 deficiency. Wild-type and 5A/SA data from Fig. 2e included for 
comparison. b, Relative percentage of S1, S2 and S3 cells among all 
erythroblasts in fetal liver from DNA-PKcs“ or DNA-PKcs“ E14.5 embryos. S4 
and S5 populations have not yet evolved significantly in the E14.5 fetal liver. 

c, The relative frequencies of S5, the most mature erythroblast, in P14 bone 
marrow from DNA-PKcs“ or DNA-PKcs“ mice. d, Relative OP-puro levels in 
P14 wild-type bone marrow haematopoietic cells. The average of total bone 
marrow was set tol. The erythroblasts (S1,S2 and $3"*") clearly have the highest 
OP-Puro levels. As shown in Fig. 2h, S3 erythroblasts have an OP-puro"®" and an 
OP-puro’ population. The mean OP-puro levels of both populations are 
included. e, Representative global protein translation in S1, S2 and S3 


erythroblasts from E14.5 DNA-PKcs” and DNA-PKcs“ fetal liver. f, Frequency 
of OP-Puro’’ among SI, S2, and S3 erythroblasts in E14.5 fetal liver. OP-Puro 
labelling of fetal liver cells was performed for 30 min. g, Quantification of the 
frequency of OP-Puro™ among Sland S2 erythroblasts from 2-week-old DNA- 
PKcs*“““Tp53*" mice. h, Relative OP-puro levels (normalized to the levels in 
untreated DNA-PKcs‘ B cells) of DNA-PKcs” and DNA-PKcs*“ B cells. 
Unpaired Student’s t-test, ***P< 0.001, *P< 0.05. i, Relative OP-puro levels of 
DNA-PKcs‘ B cells treated with ATM kinase inhibitor (KU55933, 15 1M for 17h). 
Data represent three independent biological experiments (P= 0.38 for 
unpaired Student’s ¢-test). a-d, f-i, Two-sided unpaired Student’s t-test, 

***P< 0.001, **P< 0.01, *P<0.05,n.s. P> 0.05. All graphs show mean +s.e.m. 
Exact Pvalues and defined sample sizes (n) are provided in Supplementary 
Datal. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | KU associates with SSU processome components. 

a, KU86 IP-MS and U3 ChIRP-MS overlap. Zhou et al.*? (n=2 independent MS 
assays; for each assay, n=2 technical replicates) identified 292 proteins 
enriched with KU86 protein. U3 ChIRP-MS identified 483 proteins enriched 
more than twofold with the U3 snoRNA (Supplementary Table 1). These 
identified factors were intersected, resulting in 153 proteins in common 
between the two affinity purification strategies, whichis highly significant 
(hypergeometric P< 6.650 x 10°). To understand what types of protein were 
enriched only with KU86, only with U3, or together with both factors, we 
isolated the enrichment values (—log,).(Benjamini)) for GO biological process 
terms for each of these sets and compared them. Factors commonly bound 
were biased for rRNA processing, ribosomal terms and SSU biogenesis. U3- 
specific factors had additional enrichment in these categories and KU86 hada 
set of unique terms that were not well represented in U3. b, Independent repeat 
of northern blot analyses of 18S rRNA maturation in v-ABL kinase-transformed 
Bcells from noted genotypes. The probe covers the sequence just after the 18S 


rRNA (red line). This experiment was repeated independently four times. 
Another repeat is shown in Fig. 3c. c,d, MS characterization of commercial 
DNA-PK holoenzyme (Promega) used in EMSA and kinase reactions. A detailed 
description of proteins and their quality in this mixture has not been published. 
We subjected the DNA-PK enzyme mix as provided to SDS-PAGE separation 
followed by LC-MS identification of proteins from mass ranges between 65 and 
600 kDa. KU70 and KU86 were clearly present inthe gel and via MS. For masses 
above 130 kDa, DNA-PKcswas the major protein identified. For each of the five 
slices analysed (coloured regions) we tabulated the starting positions of 
peptides matching the DNA-PKcs polypeptide and mapped them to the 
position within DNA-PKcs. As expected, inthe highest-molecular-weight slices, 
we identified peptides across the majority of the length of DNA-PKcs. DNA- 
PKcs peptides were present in lower slices, but poorer overall coverage was 
evident, suggesting that these are degradation products. Asthis wasa 
confirmatory experiment ofa validated and commercially available product 
(see Methods), it was conducted only once. 
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Extended Data Fig. 10 | DNA-PKcs and KU86 bind nuclear and nucleolar 
RNAsin living cells. a, b, irCLIP of DNA-PKcs (a) and KU86 (b) in HeLacells. RNA 
(IR8OO) and protein blots show specific enrichment of RNA UV-crosslinked to 
DNA-PKcs and KU86. The increased signal in the RNaseA-treated samples 
corresponds to the accumulation of RNaseA-protected fragments that are 
directly associated with either KU86 or DNA-PKcs. Results shown are 
representative of three biologically independent experiments. Vertical line 
marks the RNA extracted for sequencing. c,e, Fractional distribution of DNA- 
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Extended Data Fig. 11| DNA-PK interacts with structured RNAs that can 
activate its kinase. a, b, Correlation analysis of total RT stops mapping to non- 
repeat snoRNAtranscripts from DDX21** compared to DNA-PKcs (a) or KU86 
(b) irCLIP experiments from DMSO-treated HeLa cells. Correlation analysis was 
performed using Pearson’s correlation coefficient. n denotes number of 
snoRNA transcripts bound by each protein. c, EMSA of purified human DNA-PK 
and in vitro transcribed U3-SL1. Lane 1 contains only Cy7-labelled U3-SL1. Lanes 
2-4 show that KU assembles with U3-SL1 at a 1:3 molar ratio, while DNA-PK 
holoenzyme assembly occurs at a1:25 molar ratio. Lanes 5 and 6 show that 
unlabelled U3-SLIRNA competes away bound labelled U3-SL1in a dose- 
dependent manner. d, Supershift EMSA of DNA-PK and U3-SLIRNA with KU86 
antibody. The addition of anti-KU86 confirms the identity of the KU-U3-SL1 
band and also shifts up the complex to higher molecular weights. e, A structural 
mutant of U3-SL1 was generated by introducing point mutations predicted to 
disrupt the stem-loop structure. This mutant was unable to compete away wild- 
type U3-SL1 for binding to the KU complex, while unlabelled wild-type U3-SL1 
competedefficiently. f, DNA-PK in vitro kinase phosphorylation assay inthe 
presence of increasing amounts of U3-SL1 or DNA. Western blot was performed 


with an antibody recognizing DNA-PKcs phosphorylated at the T2609 cluster. 
Asterisks denote cross-reactive fragments that probably include 
phosphorylated DNA-PKcs fragments, on the basis of MS analyses of the DNA- 
PK complex (Extended Data Fig. 9c, d).g, Asinf, but using an antibody 
recognizing DNA-PKcs phosphorylated at the $2056 cluster. h-j, Asinfwith the 
following changes. h, dsDNA was used to activate DNA-PK, NU7441 was 
included to inhibit specific DNA-PK activity, and western blot analysis 
monitored the total DNA-PK (total DNA-PKcs) or phosphorylated DNA-PK 
(DNA-PKcs phoT2609). i, U3-SLIRNA was used to activate DNA-PK in the 
absence or presence of the DNA-PK inhibitor NU7441. j, U3-SL1 RNA was used to 
activate DNA-PK, hydrolysable (ATP) or non-hydrolysable (AppCp) ATP was 
provided, and western blot analysis monitored KU86 (loading control) or 
phosphorylated DNA-PK (phoT2609).k, Baculovirus-purified human DNA-PK 
in vitro kinase phosphorylation assay inthe presence of increasing amounts of 
U3-SL1or DNA. Western blot was performed with antibodies recognizing DNA- 
PKcs phosphorylated at the S2056 cluster (top), total DNA-PK (middle) and 
KU86 (bottom). All EMSA and western blots presented here are representative 
of three biologically independent experiments. 
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at least 3 independently derived samples were measured. For example, deep sequencing experiments were performed in duplicate, each 
mice genotype data were collected on more than 3 individuals from more than 2 independent crosses. 


Data exclusions No data were excluded from the analysis. 

Replication As no data were excluded, all values from each experiment are presented and the variation can be seen in the figures. Statistical analysis as 
described throughout the manuscript provided the ability to confidently assess differences between different experimental conditions. More 
information and details can be found on Supplemental Data 1. All mouse phenotype analyses were done multiple times independently with 


more than 3 mice from at least 2 independent crosses. Only results that can be successfully repeated were included in the manuscript. 


Randomization Randomization was not performed, however biological replicate experiments were performed on different days. For the animal and mouse 
cell line based experiments, the samples are allocated based on on genotypes (confirmed and validated via PCR). 


Blinding No blinding was used during the experimentation. The mutant mice were visibly smaller, genotypes have to be determined ahead of the 
experiments to plan complicated analyses. 


Reporting for specific materials, systems and methods 


aterials & experimental systems ethods 
n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
Antibodies Flow cytometry 
Eukaryotic cell lines MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


Human research participants 


Antibodies 


Antibodies used [Human Antibodies for Westerns (dilution 1:2000): pDNA-PKcs $2056 (Abcam, ab18192, lot. GR296008-1) and T2609 clone 10B1 
(ab18356, lot. GR309667-22), DNA-PKcs (Novus Biologicals, NBP2-33995, lot. R32563), DNA-PKcs (Bethyl, A303-967A), Ku80 
(Bethyl, A302-627A and Novus, NBP2-373583, lot. 140928) and p53 clone 1C12 (Cell Signaling, 2524S, lot. 13)]. Rabbit IgG 
(ThermoFisher Scientific, 02-6102, lot. Dilution 1:10,000 ). Secondary antibodies include Alexa-Fluor 488, 568, and 647 (Life 
Technologies). Antibodies for FACS analysis: biotin anti-mouse CD4 (cat. 100404, clone GK1.5, lot. B245033), biotin anti-mouse 


CD8a(cat. 100704, clone 5.3-6.7, lot. B257790), biotin anti-mouse CD3e(cat. 100304, clone 145-2C11, lot. B268248), biotin anti- 
mouse B220(cat. 103204, clone RA3-6B2, lot. B268525), biotin anti-mouse Ly60G/Ly6C (Gr-1) (cat. 108404, clone RB6-8CS5, lot. 
B236916), biotin anti-mouse TER119(cat. 116204, clone TER-119, lot. B256623), FITC anti-mouse CD8a (cat. 100706, clone 
53-6.7, lot. B259672), PE/Cy7 anti-mouse CD16/32(cat. 101318, clone 93, lot. B249751), PE/Cy7 anti-mouse TER119(cat. 116222, 
clone TER-119, lot. B251241), APC anti-mouse Ly6G/Ly6C(Gr-1) (cat. 108412, clone RB6-8C5 ), APC anti-mouse TER119(cat. 
116212, clone TER-119, lot. B246740), APC anti-mouse CD117(c-Kit) (cat. 135108, clone ACK2, lot. B255217), APC/Cy7 
Streptavidin (cat. 405208, lot. B258702), Brilliant Violet 510 anti-mouse CD41 (cat. 133923. clone MWReg30, lot. B258166), 
Brilliant Violet 510 anti-mouse CD71 (cat. 113823, clone RI7217, lot. B301604) and Pacific Blue anti-mouse Ly-6A/E(Sca-1) (cat. 
108120, clone D7, lot. B258720) were purchased from BioLegend. APC anti-mouse CD44(cat. 17-0441-82, clone IM7, lot. 
4312474), and PE-Cyanines5 anti-Hu/Mo CD45R(B220)(cat. 15-0452-83, clone RA3-6B2, lot. 2052744) were purchased from 
eBioScience. FITC rat anti-mouse CD25(cat. 1595-028, lot. 17103-P624AF), Goat anti-mouse IgM (cat. 1020-09, lot. K2915-5357) 
were purchased from Southern Biotech). PE rat anti-mouse CD4(cat. 557308, clone GK1.5, lot. 8093955), PE rat anti-mouse 
CD19 (cat. 557399, clone 1D3, lot. 7213714) and biotin rat anti-mouse CD5(cat. 553018, clone 53-7.3, lot 8115630), FITC anti- 
mouse CD43(cat. 553270, clone S7, lot. 6110908), and Alexa Fluor 700 rat anti-mouse CD34(cat. 560518, clone RAM34, lot. 
8116836) were purchased from BD Pharmingen. All FACS antibodies were used at 1:200 dilution. Antibodies used for 
Immunofluorescence: Mouse anti-Hu/Mo Ku80(111) monoclonal antibody (MA5-12933) was purchased from ThermoFisher and 
used at 1:100 dilution. Rabbit anti DDX21 polychonal antibody (NB100-1718) was purchased from Novus and used at 1:500 
dilution. Mouse anti-DNA-PKcs Ab-4 (Cocktail) monoclonal antibody (MS423P) was purchased from ThermoFisher and used at 
1:100 dilution. Alexa Fluor 488 conjugated goat anti-Rabbit secondary antibody (cat. A-11008, lot. 1937184), Cyanine3 
conjugated goat anti-mouse secondary antibody (cat. A10521, lot. 1126597), and Alexa Fluor 594 conjugated goat anti-Rabbit 
secondary antibody (cat. A-11012, lot. 1745478) were purchased from Invitrogen and used at 1:500 dilution. 


=) 
jad) 
i 
(Ss 
= 
o 
= 
o 
Za) 
© 
je’) 
= 
a) 
=r 
= 
io 
19. 
fo) 
= 
= 
=) 
a 
Za) 
(SE 
S 
=: 
red) 
< 


Validation All antibodies are extensively used and well accepted commercial antibodies and validated in several published manuscript. 
Antibodies were further validated in our lab by western blotting appropriate cell lysates and assessing the resulting western blot 
for signal where the band of the expected molecular weight of the target protein would constitute at least 80% of the total signal 
of that lane. In some cases we performed IP mass spec to further validate the specificity of the antibody. FACS antibodies were 
validated by single staining on positive population from WT mouse, and Immunofluorescence antibodies were validated by 
either comparative staining of WT cell line to corresponding knock out cell line, or by inhibitor treatment, for intensity and 
position of positive signal. See methods for details. Additionally, our in vitro kinase assay with in vitro purified humans proteins 
further validate the specificity of our DNA-PK, Ku86 and p53 antibodies. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HeLa and IMR90 were purchased from ATCC and Sigma, respectively. U2OS cells were purchased from ATCC. Most MEFs, ES 
cells and v-abl kinase transformed B cells were freshly derived from genetically modified animal models in the Zha lab, unless 
noted below. Standard protocols (can by found in references) were used to derived those cell lines. Their genotypes were 
confirmed and validated via PCR. More than 2, often more than 3 independently derived cell lines of each genotypes were 
used for each measurement. A few control cells were generously provided by colleagues. Specifically, the DNA-PKcs-/- 
Abeslon lines and the KU70-/- ES cells were originally derived in the laboratory of Dr. Frederick Alt at the Children's Hospital 
at Boston. The DNA-PKcs3A/3A Abelson lines were generously provided by Dr. Barry Sleckman then at Cornell University and 
now at the University of Alabama. Fand2-/- MEFs were generously provided by Dr. Agata Smogorzewska at the Rockefeller 
University. The genotype of all these cell lines have been independently verified by PCR in the Zha lab to the best of our 
ability. 


Authentication Commercial available cell lines were authenticated by the vendor, we have not re-authenticated the cells since we bought 
them. Investigator derived cell lines were validated by PCR genotyping to the best of our ability. 


Mycoplasma contamination All cell lines are mycoplasma free. We routinely test for mycoplasma contamination. 


Commonly misidentified lines None of the cell lines used in this study are in the database of commonly misidentified cell lines. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals This study includes experiments conducted on house mice (Mus musculus) of either pure 129S1/SvlmJ strain, or 129S1/SvlmJ 
and C57BL/6J mixed strain, and of both male and female. No gender bias were noted. All analysis, unless otherwise noted (e.g., 
tumor and survivial cohorts), was conducted on mice of about 2 weeks old, or E14.5 embryos. 


Wild animals The study did not involve wild animals 


Field-collected samples The study did not involve samples collected from the field 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 
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ethodology 

Sample preparation Bone marrow cells were collected by flushing femora and tibias stems of both lower limbs, spleen cells were collected by 
mincing the whole spleen with frosted slides, treating with ACK buffer to get rid of red blood cells unless performing peripheral 
whole blood compartment studies, and filtered with Falcon 70 micrometer nylon cell strainer. E14.5 fetal liver cells were 
collected by separating fetal livers from E14.5 feti, and further mincing the livers with frosted slides. All population staining was 
conducted directly on live cells, and translation studies were conducted by labeling live cells with O-Propargyl-puromycin in RPMI 
1640 medium containing 15% FBS for 1 hr, and then samples were fixed with 4% paraformaldehyde, permeabilized with 0.1% 
saponin before conducting CLICK-IT assay according to manufacture protocol. 

Instrument All FACS data was collected either on LSR II flow cytometry system from BD, or on FACSCalibur flow cytometry system from BD. 

Software FACS data conducted on LSR II was collected using BD FACSDiva software, and FACS data conducted on FACSCalibur was 
collected using BD CellQuest Pro. All FACS data was further gated and analyzed using Flowjo V10. 


Cell population abundance No cell sorting was conducted in this study. 


Gating strategy All gating were conducted following well accepted gating strategies of corresponding antibodies staining from published studies 
before. On preliminary FSC/SSC plot, gates were set on gaps between debris, dead cells and live cells. And all live cells were 
included in further analysis. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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After activation by an agonist, G-protein-coupled receptors (GPCRs) recruit 
B-arrestin, which desensitizes heterotrimeric G-protein signalling and promotes 


receptor endocytosis’. Additionally, B-arrestin directly regulates many cell signalling 
pathways that can induce cellular responses distinct from that of G proteins’. In 
contrast to G proteins, for which there are many high-resolution structures in 
complex with GPCRs, the molecular mechanisms underlying the interaction of 
B-arrestin with GPCRs are much less understood. Here we present acryo-electron 
microscopy structure of B-arrestin 1 (Barrl) incomplex with M2 muscarinic receptor 
(M2R) reconstituted in lipid nanodiscs. The M2R-PBarrl complex displays a 
multimodal network of flexible interactions, including binding of the N domain of 
Barrl to phosphorylated receptor residues and insertion of the finger loop of Barrl 
into the M2R seven-transmembrane bundle, which adopts a conformation similar to 
that inthe M2R-heterotrimeric G, protein complex’. Moreover, the cryo-electron 
microscopy map reveals that the C-edge of Barrl engages the lipid bilayer. Through 
atomistic simulations and biophysical, biochemical and cellular assays, we show that 
the C-edge is critical for stable complex formation, Barrl recruitment, receptor 
internalization, and desensitization of G-protein activation. Taken together, these 
data suggest that the cooperative interactions of B-arrestin with both the receptor 
and the phospholipid bilayer contribute to its functional versatility. 


Activation of GPCRs leads to heterotrimeric G-protein-mediated sig- 
nalling that quickly returns to basal levels'. This remarkably conserved 
process of GPCR ‘desensitization’ (Fig. 1a) is mainly orchestrated by 
two small families of proteins, GPCR kinases and arrestins (reviewed 
in ref. *). GPCR kinases phosphorylate agonist-bound GPCRs on their 
carboxyl (C) terminus or intracellular loops (ICLs), leading to the 
recruitment of arrestin (Fig. 1a). In humans, visual arrestin (arres- 
tin-1) and X-arrestin (arrestin-4) are selectively expressed inthe retina, 
whereas the ubiquitously expressed B-arrestins 1 and 2 (also known 
as arrestin-2 and arrestin-3, respectively) regulate the hundreds of 
GPCRs that are found elsewhere. Arrestins comprise juxtaposed N- and 
C-terminal seven-stranded B-sandwich domains with a central crest 
of three loops (finger, middle and C-loops)**. After the phosphoryl- 
ated GPCR C terminus engages the N domain of arrestin’®, conforma- 
tional changes promote the binding of central crest elements to the 
receptor seven-transmembrane (7TM) bundle, which sterically 
blocks the coupling of G proteins°. B-arrestins also act as adaptors for 


endocytic machinery, thereby increasing receptor internalization’. 
As well as modulating desensitization, B-arrestins potentiate many 
signalling pathways independently of G proteins”. Notably, certain 
‘biased’ GPCR agonists preferentially activate G-protein or B-arrestin 
pathways; this could be exploited therapeutically to obtain moreselec- 
tive drugs??°. 

Numerous high-resolution structures of GPCR-G protein complexes 
have been obtained, primarily by cryo-electron microscopy (cryo-EM) 
(reviewed in ref. "). However, to our knowledge, the only GPCR-arrestin 
structure reported so far is acrystal structure of rhodopsin fusedtoa 
constitutively active mutant of visual arrestin®””. Therefore, knowledge 
of the molecular framework of GPCR-arrestin interactions remains 
limited—especially for the B-arrestins, which modulate the vast major- 
ity of GPCRs. Here we report the cryo-EM structure of Barrlin complex 
with M2R in high-density lipoprotein (HDL) particles (lipid nanodiscs) 
that mimic a native membrane. This structure provides new insights 
into B-arrestin-mediated GPCR desensitization and signalling, and 
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Fig. 1| Recruitment of Barrl by M2Rin anative lipid environment. a, Ligand 
(L)-induced conformational changes in GPCRs lead to heterotrimeric G-protein 
activation (GTP hydrolysis) and subsequent GPCR-kinase-mediated receptor 
phosphorylation. The initial binding of B-arrestin (Barr) to phosphorylated 
receptors leads to its coupling to the transmembrane (TM) bundle, sterically 
occluding further G-protein binding. b, Barrl allosterically enhances the 
affinity of iperoxo to HDL-M2Rpp but not to DDM-M2Rpp, as determined by 
competition radioligand binding assays. The positive allosteric modulator 
LY211960 (LY211) enhanced iperoxo affinity regardless of the reconstitution 
environment. Data are the mean of three independent experiments witherror 
bars representing s.e.m.* indicates significance compared to control (Ctl) 
(one-way ANOVA). logIC,;. values: DDM-Ctl, -7.10 + 0.09; DDM-Barrl, 


highlights the importance of the membrane environment in these 
processes. 


Structural determination of M2R-Barrl1 


Thestabilization of GPCR-B-arrestin complexes has been historically 
difficult because of both the need for receptor phosphorylation and 
the low strength of the B-arrestin-7TM interaction. To reconstitute a 
functional complex we chose M2R, a family A GPCR that regulates car- 
diac function”, because Barrl has a relatively strong interaction with its 
7TM core“. To ensure homogeneous phosphorylation, we used sortase 
to enzymatically ligate asynthetic phosphopeptide (pp) derived from 
the C terminus of vasopressin 2 receptor (V2R) onto the Cterminus of 
M2R, forming M2Rpp“™ (Extended Data Fig. 1a). Although wild-type M2R 
lacks aC terminus with phosphorylatable residues and is phosphoryl- 
ated within ICL3, Barrl enhances the affinity of agonists for M2Rpp to 
the same extent as for natively phosphorylated M2R™”. To enhance 
thestability of Barrl we generated a minimal cysteine variant truncated 
at residue 393 (Barrl(MC-393)), which displayed the same allosteric 
coupling to M2Ras did the wild type (Extended Data Fig. 1b). Hereafter, 
Barrlrefers to this variant unless otherwise specified. 
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-7.03 + 0.06; DDM-LY211, -8.36 + 0.04 (P< 0.0001); HDL-Ctl, -7.49 + 0.08; HDL- 
Barrl -8.29 + 0.08 (P< 0.0007); HDL-LY211, -9.17 + 0.05 (P< 0.0001).c, HDL- 
M2Rpp, but not DDM-M2Rpp, enhances the fluorescence of bimane in the 
finger loop of Barrl (red star, inset). Curves represent the difference in spectra 
obtained with an antagonist (atropine) and an agonist (iperoxo). Data 
represent the means of three independent experiments. d, Orthogonal views 
of the cryo-EM density map of the HDL-M2Rpp-farrl complex coloured by 
subunit (orange, M2Rpp; teal, Barrl; grey/white, HDL particle). The orange- 
coloured density on Barrl corresponds to the phosphorylated C-terminal 
peptide (V2Rpp) ligated tothe receptor. The nanodisc density, omitted in the 
middle panel for clarity, was generated earlier in image processing before high- 
resolution refinement of the M2Rpp-farrl complex. 


Initially we characterized the ability of Barrl to bind M2Rpp in 
n-dodecyl-B-D-maltoside (DDM-M2Rpp) detergent. Barrl and the anti- 
body fragment Fab30, which stabilizes the Barrl-V2Rpp interaction, 
co-immunoprecipitate with the receptor only whenitis phosphorylated 
(Extended Data Fig. 1c). Because Barrl binding to the 7TM core of the 
receptor enhances agonist binding», we monitored this interaction 
using radioligand competition binding assays to measure agonist affin- 
ity. The affinity of the agonist iperoxo for DDM-M2Rpp is increased by 
the positive allosteric modulator LY2119620 but not by Barrl (Fig. 1b). By 
contrast, when M2Rpp is reconstituted into nanodiscs (HDL-M2Rpp)— 
which providea more physiological-like lipid bilayer—both LY2119620 
and arrlincrease the affinity of iperoxo for the receptor (Fig. 1b). The 
importance of the lipid bilayer in Barrl-receptor binding was further 
tested by labelling the Barrl finger loop—which inserts into the 7TM 
core—with the probe monobromobimane (mBr), the fluorescence of 
whichincreases in hydrophobic environments. For these experiments 
we used alabelled mutant witha valine-to-cysteine substitution at posi- 
tion 70 (V70C); this variant was shown to still couple to the 7TM core 
of M2Rpp by radioligand binding experiments (Extended Data Fig. 1b). 
We observed an agonist-dependent increase in the fluorescence of 
bimane with HDL-M2Rpp but not with DDM-M2Rpp (Fig. 1c, Extended 


d Wer Qarrl) = M2R-G, 


M2R-farr1 
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Data Fig. 1d), which suggests that Barrl interacts with the 7TM core of 
M2Rpp only ina lipid bilayer”. 

We optimized M2Rpp-farrl complexes for cryo-EM using radioli- 
gand competition binding experiments and negative-stain electron 
microscopy”. The final complex—iperoxo/LY2119620-liganded M2Rpp 
in MSP1D1E3 lipid nanodiscs, bound to Barrl and the antibody fragment 
Fab30—was monodisperse and displayed an affinity for the agonist that 
was 350-fold greater than that of the receptor alone (Extended Data 
Fig. le, f). Fab30 was included to stabilize the Barrl-V2Rpp interaction 
and to aid incryo-EM particle alignment’. We chose MSP1D1E3 nano- 
discs with a diameter of 12 nm, because preliminary cryo-EM analysis 
of an M2Rpp-fBarrl complex in smaller MSPIDHS nanodiscs (around 
9-nm diameter) indicated a mixture of ‘hanging’ conformations, in 
which Barrl interacted only with V2Rpp, and ‘core’ conformations, in 
which farrl also engaged the 7TM bundle” (Extended Data Fig. 1g, h). 
We previously observed such conformations in our electron micros- 
copy analysis of complexes formed by Barrl and the B,-adrenergic 
receptor!’. Low-resolution cryo-EM maps of MSP1DH5 M2Rpp-farrl 
indicated conformational variability even among particles adopting 
the core interaction, with Barrl ‘rocking’ with respect to M2Rpp and 
the Barrl C-edge occasionally approaching the lipid bilayer (Extended 
Data Fig. 1h). Given that hydrophobic residues in the C domain of visual 
arrestin interact with the membrane”, we proposed that the extended 
lipid surface of larger MSP1D1E3 nanodiscs could stabilize complexes 
for studies at higher resolution. 

In complexes in 12-nm nanodiscs, the interactions of Barrl with the 
receptor 7TMcoreand the lipid surface were more uniform (Extended 
Data Fig. 1i). Nevertheless, variability in the size and tilt of the lipid 
nanodisc relative to the Barrl-Fab30 density presented a challenge 
for structure determination. Low-resolution analysis confirmed small 
differences among particle classes in the angle of Barrl relative to the 
receptor 7TM, indicating inherent flexibility (Extended Data Fig. 2a). 
Through three-dimensional (3D) classification we identified a class of 
approximately 145,000 particle projections (17.4% of 831,443 particles 


Fig. 2| Structure of the M2Rpp-farrl complex. 
a-c, Orthogonal views of the M2Rpp-farrl 
structure coloured by subunit (orange, M2Rpp; 
teal, Barrl; grey sticks, model lipid bilayer). 

d, Superposed structures of M2Rpp-farrl and 
M2R-G, complexes (M2R, grey; G,, Ras domain, 
pink; PDB: 6OIK) aligned by M2R.G, By subunits 
are omitted for clarity. The top rightimage shows 
thecytoplasmic view of M2Rtransmembrane 
alignment in the absence of transducers. The 
bottom right inset shows the similar insertion 
depth of the finger loop of Barrl and the a5-helix 
ofG,intothe TM bundle of M2R. e, Superposition 
of M2Rpp-farrland Rho-Arrl structures 
(rhodopsin, grey; arrestin1, green; PDB: 4ZW)). 
The inset shows an enlarged view of the finger 
loops of Barrl (M2Rpp) and visual arrestin 
(rhodopsin). 
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with a well-defined Barrl—-Fab30 region) that showed solid density 
for the 7TM portion in one position. Other 3D classes lacked defined 
receptor density, possibly due to a combination of specimen distor- 
tion at the air/water interface during cryo-sample preparation”° and 
potential projection misalignments due to the large nanodiscs. 3D 
refinement of the well-defined class enabled us to obtain a 3D map of 
the M2Rpp-farrl complex with a global indicated resolution of 4A 
(Fig. 1d, Extended Data Fig. 2b). This map showed relatively lower reso- 
lution in the 7TM region, including poor density of the highly mobile 
transmembrane helix 1 (TM1), but nevertheless enabled a confident 
comparison with M2R active-state structures. Subsequent refinement 
focused on Barrl and its interface with M2Rpp, and yielded a map of 
3.6 Aresolution displaying well-resolved features, which were used for 
model building in this region (Extended Data Fig. 3). 


Topography of the M2R-farr1 complex 

The cryo-EM structure of the M2Rpp-Barrl complex in nanodiscs 
reveals a multimodal interaction network, in which the N domain of 
Barrl engages the phosphorylated C terminus of the receptor, the inter- 
domain loops of Barrl engage the 7TM core and ICL2 of the receptor, 
andthe C domain of Barrl engages the phospholipid bilayer (Fig. 2a—c). 
Topologically, the central crest region interacting with the 7TM bun- 
dle occupies a similar position to the Ras domain of G proteins. Com- 
parison of the structure of M2Rpp-farrl with our recent structure of 
the M2R-G, complex (PDB: 6OIK) reveals that the 7TM bundle of the 
receptor adopts similar active-state configurations in both complexes, 
characterized by the opening of the TM6 cytoplasmic portion? (Fig. 2d). 
Relatively small shifts in the cytoplasmic ends of TMS, TM6and TM7 are 
probably requiredto accommodate Barrl, although we cannot exclude 
effects arising from the use of lipidic (M2Rpp-Barrl) compared with 
detergent (M2R-G,) environments. The similarity of the transducer- 
bound conformations of M2R inthe two structures is consistent with 
the similar allosteric enhancement of agonist binding to M2R by Barrl 
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Fig. 3 | Regions ofinteraction between M,R and Barrl. The M2Rpp-Barrl 
complex, with dashed boxes indicating the maininteraction sites between 
Barrl (teal) and receptor (orange). a, Enlarged view of the finger loop of Barrl 
inserting into the TM bundle of M2R. The model includes side-chain positions 
from PHENIX refinement with OPLS3e electrostatics. b, Enlarged view of the N 
domain of Barrl bound to the C terminus of phosphorylated M2Rpp. 


and G protein". In M2R-G,, opening of the cytoplasmic portion of 
TM6 accommodates the a5-helix of G,. Similarly, displacement of 
TM6 accommodates the finger loop of Barrl1, which occupies a simi- 
lar position to the a5-helix of the Ras domain of G,, albeit at aslightly 
lower depth and involving a smaller interaction surface (Fig. 2d). This 
difference probably accounts for the lower affinity and inherent con- 
formational variability of the Barrl-7TM interaction compared with 
the M2R-G, interaction. 

In the cryo-EM structure of M2Rpp-farrl and the crystal structure 
of rhodopsin-visual arrestin (denoted Rho-Arr1); PDB: SWOP*®, the 
arrestins show notable differences despite their similar orientations 
relative to the 7TM bundles (Fig. 2e). Barr is tilted approximately 7° 
further towards the membrane (Extended Data Fig. 4a)—thisis probably 
due to the interaction of the C domain with the nanodisc phospholip- 
ids, although crystal packing interactions inthe structure of Rho-Arrl 
might also contribute to these differences. In addition, the finger loops 
display differences in both positioning and structure. Whereas the 
finger loop of visual arrestin is modelled with an a-helical segment 
(residues E71-G77), the finger loop of Barrl adopts an extended loop 
configuration. The other arrestin interdomain loops adopt similar 
conformations in these two GPCR-arrestin structures (Extended Data 
Fig. 4b). 


Interaction of Barrl with M2R 

Barrl interacts with M2Rpp through interfaces with the phosphoryl- 
ated C terminus, the 7TM core, and ICL2 of the receptor (Fig. 3). The 
phosphopeptide-farrl interaction is essentially identical to that in 
our crystal structure of V2Rpp-fBarr1-Fab30’, and six phosphoryl- 
ated residues are well resolved in the 3.6 A map. The peptide binds to 
a positively charged crevice on the N domain by displacing the C ter- 
minus of Barrl, which destabilizes the arrestin polar core and enables 
the gate loop to flip towards the N domain. A critical phosphorylated 
residue on the C terminus of the receptor, T491 (T360in V2Rpp), estab- 
lishes interactions with R25 (N domain) and K294 (gate loop) of Barr1, 
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c, Expanded view of the interaction between ICL2 of M2R (ribbon) anda 
hydrophobic cleftin Barrl (rendered as electrostatic surface; red, blueand 
white graduations indicate negative, positive and neutral surface potential, 
respectively). The mesh in all panels depicts the 3.6 A cryo-EM map contoured 
at o= 3.0 (where ois the root meansquare of the electron density in the unit 
cell) witha masked 2.0 Azonearound the atoms depicted. 


stabilizing an activated conformation characterized by interdomain 
twisting” (Fig. 3a). The fused V2Rpp is 36 residues long and provides 
ample flexibility so as not to limit the orientation of arrestin; this is also 
evident by the lack of observed density for the 23 residues that do not 
engage arrestin. Similarly, although we cannot rule out the possibil- 
ity that natively phosphorylated ICL3 residues influence the relative 
orientation of Barr1, this seems unlikely given the length of this loop 
(152 residues) and its lack of order in our structure. 

Thesecond interface between the finger loop of Barrl and the 7TM 
bundle of the receptor involves both hydrophobic and electrostatic 
interactions (Fig. 3b). The C-terminal finger loop region—including 
residues L68, V70, L71and F75—is hydrophobically packed against the 
side of the receptor pocket formed by TM3, TMS, TM6 and ICL2. Onthe 
opposite side, the map of the N-terminal part of the finger loop displays 
partial side-chain densities of ionic residues R65, E66 and D67, reveal- 
ing both their positions and their relative mobilities. A potential salt 
bridge between R65 (finger loop) and D135 (middle loop) of Barrl could 
stabilize the conformation of the finger loop, whereas E66—although 
resolved only through CB—points towards R62 of the finger loop and 
K138 of the middle loop (Extended Data Fig. 5a). At the tip of the fin- 
ger loop, D69 seems to be positioned to form hydrogen bonds or salt 
bridges with several receptor residues, including N58”*? and R121?*° 
of the highly conserved DRY motif (superscripts denote Ballesteros- 
Weinstein numbering for GPCRs””). These interactions formed consist- 
ently across multiple, independent molecular dynamics simulations, 
suggesting that this interaction network stabilizes the M2Rpp-Barrl 
complex (Extended Data Fig. 5b, c). Considering that R*°°-E°*° salt 
bridges stabilize the inactive receptor state of M2R and other Class A 
GPCRs, Barrl may stabilize an active receptor partly by engaging R®°°. 
The mutant Barrl(D69A) displays significantly reduced coupling to 
M2Rpp, which supports this hypothesis (Extended Data Fig. 5d). 

The finger-loop fold, the interdomain twist, and the binding mode 
through the C terminus of V2Rpp that we observe here for Barrl 
are very similar to those in the crystal structure of the active Barrl- 
V2Rpp-Fab30’ (PDB: 4JQI, Extended Data Fig. 4c). However, the crystal 
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Fig. 4| Anchoring of the C domain of Barr1 to the lipid membrane. 

a,b, Atomistic simulations of Barrl in complex with M2Rpp (in the absence of 
Fab30) to investigate the position of the C-edge of Barrl (a) and the Barrl 
interdomain twist angle in the absence (dark blue) or presence (cyan) of alipid 
membrane (dashed black line in a) (b).c, HDL-M2Rpp, but not HDL-M2R, 
enhances the fluorescence of bimane at the C-edge of Barrl (inset, red star) 
independent of the presence of antagonist (atropine) or agonist (iperoxo). 

d, The ability ofiperoxo-activated HDL-M2Rpp to increase the fluorescence of 
bimane in the finger loop of Barrl (inset, red star) requires receptor 
phosphorylation (compare with HDL-M2R) and is markedly reduced by the 
C-edge 3xD mutations (L335D/L338D/S34 0D). Data are representative of three 
independent experiments. 


structure shows a large twist in the orientation of the finger loop, due 
to the N-terminal segment of the phosphopeptide inducing a twist in 
the flanking B-strands S5 and S6. This V2Rpp segment must disengage 
before the finger loop can be inserted into M2Rpp and is not ordered 
in our map. The positioning of the N-terminal V2Rpp segment in the 
crystal structure could be due to lattice packing. Alternatively, these 
differences could reflect a multi-step binding mechanism, in which 
the initial binding of Barr1 to the full length of the phosphopeptide 
is followed by release of the N terminus of the phosphopeptide, flip- 
ping of the Barrl finger loop, and burial of the hydrophobic residues 
of the finger loop in the intracellular cavity of the receptor (Extended 
Data Fig. 6a). 

The M2Rpp ICL2 rests in a cleft between the N- and C domains of 
Barrl, which comprises portions of the finger, middle, gate and C-loops. 
Although this portion of the map lacks well-resolved side-chain fea- 
tures, the density suggests that ICL2 adopts the same helical conforma- 
tionasinM2R-G, (Fig. 3c). This ICL2 orientation places L129***" facing 
a Barrl hydrophobic cleft that is capable of accommodating even larger 
side chains (Extended Data Fig. 6b). Although leucine is most prevalent 
at this positionin other class AGPCRs that couple to B-arrestins, larger 
hydrophobic amino acids (suchas methionine and phenylalanine) are 
common (Extended Data Fig. 6c), suggesting one source of the promis- 
cuity of B-arrestins. Notably, the same hydrophobic ICL2 residue often 
engages in van der Waals interactions with G proteins?” >, 


Membrane interaction of Barrl Cdomain 


The cryo-EM maps reveal that the C-edge of Barrl interacts with the 
nanodisc, with one C-edge loop contacting the membrane (loop 1, 
residues 191-196, LMSDKP) anda second loop burying itself inthe lipid 
bilayer (loop 2, residues 330-340, SRGGLLGDLAS) (Extended Data 
Fig. 7). Interaction of the C-edge of arrestin with the membrane has 


been suggested by the structure of Rho-Arr1, and by site-specifically 
labelling the C-edge of visual arrestin with environmentally sensitive 
probes”. Given that the affinity of Barrl for the 7TM core of M2Rppis 
higher in nanodiscs than in detergent (Fig. 1b, c), we proposed that there 
are functional consequences to the C-edge-lipid interaction. First, we 
performed multiple, independent molecular dynamics simulations” 
with M2Rpp-farrlinitially embedded in either acontinuous membrane 
(enabling C-edge interaction) ora 9-nm-diameter nanodisc (too small 
for C-edge interaction) (Fig. 4a, Extended Data Fig. 8a). In four of the 
five simulations using the full membrane, Barrl remained in its active 
conformation with an average interdomain twist angle of around 17°— 
similar to the interdomain twist observed in the crystal structure of 
the Barrl-V2Rpp-Fab30 complex’. However, insmall nanodiscs, Barr 
adopted a more heterogeneous conformational ensemble dominated 
by inactive conformations (with aninterdomain twist angle of between 
0° and 7°) (Fig. 4b); this suggests that the interaction between the 
C-edge andthe membrane stabilizes the active conformation of Barr1. 

We site-specifically labelled C-edge loop 2 with mBr (L338mBr) to 
assess whether the Barrl-lipid interaction requires the binding of Barrl 
tothe M2Rpp phosphorylated tail or the 7TM bundle”. An increasein 
fluorescence of Barr1 L338mBr, which reflects insertion of the C-edge 
into a hydrophobic environment, requires M2Rpp phosphorylation 
and nanodisc reconstitution (Fig. 4c, Extended Data Fig. 8b). This is 
consistent with previous observations that the engagement of visual 
arrestin with membranes requires rhodopsin phosphorylation”. How- 
ever, the change in Barr1L338mBr fluorescence is not agonist (iperoxo)- 
dependent, whichindicates that the C-edge-lipid interaction does not 
require interaction between Barrl and the 7TM bundle. Because the 
C-edge-membrane interaction is observed in nanodiscs but not in 
detergent (Fig. 1b, c), we suggested that it might facilitate Barr1-7TM 
coupling. We sought totest this by attenuating the hydrophobicity of 
the C-edge with aspartic acid substitutions at L335, L338 and S340 of 
Barrl (Barr1(3xD))”. Fluorescence of Barrlin which the finger loop was 
labelled with bimane (Fig. 4d, Extended Data Fig. 8c) confirmed that 
the 3xD mutation markedly reduced the interaction of Barrl with the 
7TM bundle of HDL-M2Rpp. Thus, maximal coupling of Barrl to the 
7TM bundle of M2Rpp requires not only receptor phosphorylation but 
also interaction of the C-edge of Barrl with the phospholipid bilayer. 

Furthermore, we investigated whether the interaction between the 
C-edge of Barrlandthe membrane modulates Barrl-mediated receptor 
desensitization and internalization. We measured the ability of purified 
wild-type Barrl and Barrl(3xD) to inhibit (desensitize) HDL-M2Rpp- 
mediated G-protein activation in vitro. Iperoxo-induced activation of 
HDL-M2Rpp increases the GTPase activity of purified heterotrimeric G;. 
Whereas wild-type Barrl blocks 50% of G, activity, Barrl(3xD) has almost 
no effect (Fig. 5a). Wethen compared the ability of GFP-tagged wild-type 
Barrl (GFP- Barrl(WT)) and GFP-Barrl1(3xD) to induce the internaliza- 
tion of Flag-tagged M2R (Flag—M2R) in Barrl1/Barr2-null HEK293 cells. 
After 5 minutes of iperoxo stimulation, GFP-Barr1(WT) translocates 
from the cytoplasm to the plasma membrane. After 30 minutes, both 
GFP-Barrl(WT) and Flag—-M2R (morethan 40%) are largely internalized 
(Fig. 5b, c, Extended Data Fig. 9a, b). By contrast, GFP-Barrl(3xD)— 
which is expressed at similar levels to the wild-type (Extended Data 
Fig. 9c, d)—remains in the cytoplasm even after 30 minutes, with little 
Flag—M2R internalization (10%). Notably, the impaired recruitment 
of GFP-Barr1(3xD) to wild-type M2R and V2R (Extended data Fig. 9e) 
shows that membrane anchoring contributes to the recruitment of Barrl 
to natively phosphorylated GPCRs, not only the engineered M2Rpp 
construct. Thus, the interaction of Barrl with the membraneis critical 
for desensitization and internalization of the receptor. 


Three-siteinteraction model 


The binding of arrestin to GPCRs has been viewed as a two-part process: 
binding to the phosphorylated ICLs or the C terminus of the receptor 
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Fig. 5| The functionality of Barrl depends on C domain-lipid interactions. 
a, Activation of purified heterotrimeric G, protein by iperoxo-stimulated HDL- 
M2Rppin vitro is reduced by wild-type Barrl but not by Barrl(3xD). Data are the 
mean of three independent experiments with error bars representing s.e.m. 
*P<0.0001, one-way ANOVA compared towild-type Barrl plus iperoxo. 

b, Iperoxo stimulation of Flag—M2R causes plasma membrane recruitment of 
GFP-Barrl(WT) and subsequent receptor internalization, whichis impaired by 
the 3xD mutations, as demonstrated by confocal microscopy. Barrl, nucleiand 
M2Rarecoloured green, blue, and red, respectively. Confocal microscopy 
images are representative of three independent experiments. c, Quantification 
of Flag-M2R internalization by flow cytometry insame cells as b. Dataare the 
mean of three independent experiments with error bars representing s.e.m. 
*P=0.0008, unpaired two-sided t-test compared to wild-type Barrl. 


induces conformational changes in B-arrestin, which subsequently 
promotes its coupling to the 7TM bundle of the receptor. Our find- 
ings, supported by observations of the Rho-Arrl complex” and the 
NTSR1-Barrl complex”®, expand this model to include a critical Barrl- 
lipid interaction (Extended Data Fig. 9f). In this three-site interaction 
model, the recruitment of arrestin requires both GPCR phosphorylation 
andaninteraction of Barrl with the plasma membrane (Fig. 5b), which 
could increase the arrestin concentration at the cell membrane before 
receptor activation. The C-edge-membrane interaction subsequently 
enhances the binding of arrestin to the 7TM bundle (Fig. 4d) and the 
desensitization of G-protein activation (Fig. 5a). 

Whereas membrane anchoring may be a conserved property of 
arrestins, the mechanisms that underlie membrane association could 
be distinct. The C-edge loop buried in the nanodisc is found in visual 
arrestin andthe dominant form of Barr1, but not in Barr2 or analternative 
Barrl splice variant. The binding site of phosphatidylinositol-4,5-bis- 
phosphate (PtdIns(4,5)P,)—a phospholipid observed inthe structure of 
the NTSR1-farrl complex, described ina related study”°—is conserved 
in B-arrestins but not in retinal arrestins”°?””5 (Extended Data Fig. 7). 
Accordingly, the variability of membrane compositions could provide 
yet another level of regulation for GPCR desensitization, internalization 
and signalling. Notably, Barr2 can stimulate MAP kinase signalling from 
clathrin-coated structures after dissociation from activated GPCRs”. 
These and other recent data suggest that the lipid membrane stabilizes 
the activation of B-arrestin when bound to, and even after dissocia- 
tion from, the receptor’ *1, Thus, as is evident from the structures 
of Barrl bound to M2Rpp or NTSR1”, a complex cooperative network 
of low-affinity interactions involving both receptor and membrane 
phospholipids endowarrestins with the necessary plasticity to variably 
couple to hundreds of GPCRs. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomizedand the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Receptor expression and purification 

M2R containing an N-terminal Flag tag, C-terminal sortase ligation 
consensus sequence (LPETGGH), and 6xHis-tag was cloned into pcDNA- 
zeo-teto and stably expressed in tetracycline-inducible Expi293F cells 
(Invitrogen)“. Cells were not authenticated or routinely tested for 
mycoplasma. Cells were grown toa density of 4-5 million cells per ml 
and then treated with Expi293F expression enhancers (Thermo Fisher), 
5puMatropine, and 5 uMkifunensine to restrict glycosylation. Expres- 
sion was induced 18 h thereafter by addition of 4 pg ml™ doxycycline 
and 5 mM sodium butyrate. Cells were collected 48 h post-induction 
and stored at -80 °C until further processing. Flag-M2R-LPETGG-His6 
was purified as previously described”, with all purification steps con- 
ducted at 4 °C with protease inhibitors (benzamidine and leupeptin) 
unless stated otherwise. In brief, cells werelysed for 30 min by stirring 
in10 ml g ‘wet cell mass of lysis buffer (10 mM Tris (pH 7.4), 2 mMEDTA, 
and10 mMMgCL, 5 units per ml benzonase, 5 pM atropine, and 2 mg 
ml iodoacetamide). Membrane was pelleted at 30,000g for 20 minand 
resuspended in10 ml g‘ original cell pellet mass of solubilization buffer 
(20 mM HEPES (pH 7.4), 750 mM NaCl, 1% n-dodecyl-B-D-maltoside 
(DDM), 0.05% cholesterol hemisuccinate, 10% glycerol, 5 units per 
ml benzonase, 5 uM atropine, and 2 mg mI iodoacetamide). After 
extensive homogenization using a Dounce homogenizer, solubilizing 
membrane was sequentially stirred at room temperature and 4 °Cforl 
heach. Insoluble material was removed by centrifugation at 30,000g 
for 30 min, and the supernatant was loaded onto M1-Flag resin with 
2 mM CaCl, at 1-3 ml min”. M1-Flag resin was washed with 5 column 
volumes each of wash buffer (20 mM HEPES (pH 7.4), 0.1% DDM, 0.01% 
CHS, 2 mM CaCl, and 1M atropine) containing high (750 mM) and 
low (100 mM) NaCl at ratios of 4:0, 3:1, 2:2, 1:3 and 0:4, respectively. 
Receptor was eluted in elution buffer (20 mM HEPES (pH 7.4), 100 mM 
NaCl, 0.1% DDM, 0.01% CHS, 0.2 mg mI“ Flag- peptide, 1 pM atropine, 
and 5 mM EDTA), and glycosylation was removed by incubation with 
a 1:10 protein ratio of EndoH to M2R for 90 min at room temperature. 
The enzyme sortase was used to ligate the synthetic phosphopeptide 
GGG-V2Rpp (GGG-ARGRpTPPpSLGPQDEpSCp TpTApSpSpSLAKDTSS) 
onthe C terminus of M2R (M2Rpp) as previously described"*. Mono- 
meric receptor was collected by size-exclusion chromatography ona 
Superdex200 Increase column (GE Healthcare Life Sciences). 


HDL reconstitution 

DDM-M2R was pre-incubated with twofold molar excess atropine on 
ice for 30 min before reconstitution. Receptor (5 1M) was mixed with 
8 mM POPC/POPG lipids and 150 pM MSPIDIE3 onice for 1 hina final 
buffer composition of 20 mM HEPES (pH 7.4), 100 mM NaCl, and 0.5 mM 
EDTA. Detergent was removed using Bio-Beads (BioRad) (50 mg per 100 
pl reaction volume) and rotated overnight at 4 °C. The supernatant was 
separated from the Bio-Beads using a 28-gauge needle and was diluted 
with 20 mM HEPES (pH 7.5) and 100 mM NaCl (HN buffer) to obtain 
a final concentration of 2 uM HDL-M2R. HDL-M2R was rotated with 
M1-Flag resin for 30 min at room temperature and then an additional 
15 min after adding 2 mM CaCl,. M1-Flag resin was quickly washed in 
column format with five resin volumes of HN buffer with 2mM CaCl,. 
HDL-M2R was eluted with HN buffer containing 0.2 mg mI Flag peptide 
and5 mM EDTA. Size-exclusion chromatography was used to collect 
monomeric HDL-M2R and to remove Flag peptide. 


Parrl and Fab30 purification 
Toenhance expression and stability, aminimal cysteine (C59A, C125S, 
C1401, C150V, C242V, C251V and C269S) and truncated (after amino 


acid 393) variant of rat B-arrestin1 (Barrl-MC-393) in pGEX4T was 
generated. Barrl-MC-393 was expressed and purified as previously 
described**. In brief, GST-Barrl-MC-393 was expressed in BL21(DE3) 
bacteria, lysed by sonication, and captured using glutathione (GST) 
Sepharose. Barrl-MC-393 was cleaved from GST by thrombin digestion 
and further purified using HiTrap Q Sepharose anion exchange. Fab30 
was purified as described previously’. 


HDL-M2Rpp Barrl complex formation and purification 
HDL-M2Rpp was pre-incubated for 30 min onice with a fivefold molar 
excess of iperoxo and LY211960 and then 90 min at room temperature 
witha twofold molar excess of Barrl-MC-393 and Fab30. The complex of 
HDL-M2Rpp-farrl-Fab30 was separated from unbound Barrl-MC-393 
and Fab30 by M1-Flag chromatography as described above. After M1- 
Flag elution, the complex was subjected to size-exclusion chromatog- 
raphy ona Superdex200 Increase column with HN containing 1 1M 
iperoxo and 2 tMLY211960. Peak fractions were concentrated to obtain 
a final concentration of 2 mg mI for cryo-EM analysis. 


Co-immunoprecipitation 

DDM-M2R or DDM-M2Rpp (4 pg) was pre-incubated with iperoxo 
(10 1M) for 30 min onice before the addition of twofold molar excess 
purified Barrl-MC-393 and Fab30. Reactions were incubated at room 
temperature for 1h, rotated with M1-Flag resin for 20 min, washed 
with HN buffer containing 0.1% DDM, and eluted with HN buffer 
with 0.1% DDM, 0.4 mg ml Flag peptide and 10 mM EDTA. Proteins 
were separated by SDS-PAGE and visualized by Instant Blue stain 
(Expedeon) 


Radioligand binding assays 

Competition equilibrium binding experiments with DDM-M2R were 
conducted using ascintillation proximity assay (Perkin Elmer) ina final 
volume of 100 pl containing HN buffer with 0.1% DDMand1mg mI? 
BSA. DDM-M2Rpp was incubated for 1h at room temperature with 1 
uM Barrl or 10 pMLY2119620, 1 nM [2H]N-methylscopolamine (NMS), 
and varying concentrations of iperoxo. Yttrium silicate (YSi) proteinA 
beads (Perkin Elmer) coated with Flag-M1 antibody were subsequently 
added for 30 min, and YSi emission was read ona Wallac 1450 Microbeta 
Plus. Radioligand binding with HDL-M2R was conducted in HN with 
1mg mI BSA with the same final concentrations of components as 
above. Reactions proceeded for 90 min and were collected onto glass 
fibre filters (GF-B) soaked with 0.3% polyethyleneimine using a Brandel 
harvester. Data from competition radioligand binding assays were 
analysed using GraphPad software. 


Bimane fluorescence 

Purified rat Barr1-MC-393(V70C), Barrl-MC-393(L337C) and Barrl- 
MC-393(V70C/L334D/L337D/S339D) (3xD) were labelled overnight 
at 4 °C with athreefold molar excess of mBr (Sigma) and an additional 
threefold molar excess added for 1h at room temperature the next day. 
Reactions were quenched with L-cysteine, and free mBr was removed 
by size-exclusion chromatography. For bimane experiments, DDM- 
M2R or HDL-M2R was pre-incubated with 10 uM iperoxo or atropine 
and twofold molar excess Fab30 for 20 min at room temperaturein 
HN buffer containing 1 mg mI BSA. Bimane-labelled complexes mBr- 
Barrl1-MC-393(V70C), mBr-Barrl-MC-393(L337C) or mBr-Barr1-MC- 
393(V70C/3xD) were added to a final concentration of 200 nM witha 
1.5-fold molar excess of DDM- or HDL-M2R. For DDM-M2R, the buffer 
additionally contained 0.1% DDM and 0.01% CHS. Reactions were equili- 
brated for 30 minin black, solid-bottom 96-well microplates (Corning) 
before fluorescence emission spectra were collected ona CLARIOstar 
plate reader (BMG Labtech) in top-read mode, with excitation at 370 
nm (16-nm bandpass) and emission scanning from 400 nm to 600 nm 
(10-nm bandpass) in1-nm increments. Wells for background subtrac- 
tion contained all components except bimane-labelled Barrl-MC-393. 
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Statistics were conducted by comparing the area under the curves 
(GraphPad Prism) from at least three independent experiments. 


Cryo-EM data acquisition and data processing 

Before cryo-EM preparation, all samples were screened for quality 
using conventional negative-stain electron microscopy”. In preliminary 
cryo-EM experiments, M2Rpp-Barrl-Nb24-scFv30 (Nb24: camelid 
nanobody) complex inthe smaller (~-9-nm diameter) MSP1D1H5 nano- 
discs, at a concentration of 1mg ml", was applied to glow-discharged 
200-mesh grids (Quantifoil R1.2/1.3), and vitrified using a Vitrobot 
Mark IV (Thermo Fisher Scientific) at 4 °C and 100% humidity. Images 
were collected on a Titan Krios equipped with a Gatan K2 Summit 
direct electron camera, at anominal magnification of 59,000x with 
a pixel size of 0.86 A. A total of 3,589 movie stacks were acquired with 
dose-fractionation into 40 frames with an accumulated dose of 70 
electrons per A. Dose-fractionated stacks were subjected to beam- 
induced motion correction using MotionCor2*. Data processing was 
performed in RELION3.0*®. Particle projections (487,582) were selected 
by template-based autopicking and subjected to 2D classification. 
A subset of around 70,000 particles showing density for both HDL- 
receptor and Barrl were subjected to 3D classification. One out of six 
3D classes showed a ‘hanging’ arrestin conformation, and three classes 
showed a ‘core’ interaction between receptor and arrestin but with 
‘rocking’ motion of arrestin relative to the receptor-embedded nano- 
disc (Extended Data Fig. le, f). 

M2Rpp-farrl—-Fab30 complex in the larger (~12-nm diameter) 
MSPID1E3 nanodiscs was prepared as described above, except that 
cryo-EM images were collected on a Titan Krios equipped with the 
Gatan GIF Quantum LS Imaging energy filter using aGatan K2 Summit 
direct electron camerain counted mode, corresponding toa pixel size 
of 1.06 A. Each image was dose-fractionated into 40 frames witha dose 
rate of 7 electrons per pixel per second and total exposure time of 8 
s, resulting in an accumulated dose of 50 electrons per A2. Data were 
collected in seven independent sessions resulting in a total number 
of 30,454 movie stacks. Contrast transfer function parameters for 
each micrograph were determined by Gctf v.1.06*°. Auto-picking was 
performed with templates that were generated from manually picked 
particles. Particles were subsequently screened by 2D reference-free 
classification followed by 3D classification. Particle projections were 
pooled together after independent 3D classification of each dataset. 
A representative data-processing workflow is shown in Extended 
Data Fig. 2b. In 2D and 3D classification, around 25% of particle pro- 
jections showing well-defined density for complex components indi- 
cated an extra density connected to Fab30, which we suspect to be 
anon-specifically attached second Fab30. Because this domain was 
variable among complexes, a mask was applied to remove its density 
during 3D classification. Particles from each dataset contributing to 
3D reconstructions with well-defined features were merged with the 
other six subsets obtained ina similar strategy, resulting a total number 
of 831,443 projections with well-defined particle components. Owing 
to the variability in size and tilt of the large MSP1D1E3 nanodisc, which 
was dominating the projection alignment, the 831,443 particle projec- 
tions were subjected to a focused alignment on Barrl-Fab30 density 
followed by 3D classification without alignment while masking out 
the nanodisc density. One 3D class accounting for 145,618 particles 
showed a well-defined transmembrane portion, and its particle pro- 
jections were subjected to 3D refinement after subtracting from raw 
images the density of the nanodisc and the constant (CL/CH1) portion of 
Fab30 due to its relative flexibility, but leaving the scFv portion intact. 
This strategy resulted in a 3D reconstruction of the M2Rpp-farrl- 
Fab30(scFv) with global indicated resolution of 4.0 A. In this map, the 
extracellular portion of M2Rpp showed the lowest relative resolution. 
To improve the densities for the Barrl and M2Rpp interface we used 
the refinement parameters from the global map to subtract more than 
half of the extracellular receptor side together with nanodisc and the 


constant (CL/CH1) domains of Fab30 from raw particle images. The 
subtracted particles were imported to cisTEM” for a refinement with 
local search, resulting in a focused map with global indicated resolution 
of 3.6A. Global resolution estimations were obtained with PHENIX from 
two half maps at Fourier shell correlation (FSC) cutoff of 0.143. Local 
resolution determination of the global M2Rpp-Barr1—Fab30(scFv) 
map and the focused Interface M2Rpp-Barrl-Fab30(scFv) map was 
calculated with blocres from the Bsoft package”® at an FSC cutoff of 
0.5 (Extended Data Figs. 3b, c, Extended Data Table 1). 


Model building and refinement 

The initial model was prepared by manually docking the cryo-EM 
structure of active-state M2R (PDB: 6OIK) and the crystal structure of 
activated V2Rpp-fBarrl-Fab30 (PDB:4JQI) into the cryo-EM map. We 
used the global M2Rpp-Barrl-Fab30(scFv) 4A map for modelling the 
7TM bundleand the focused interface M2Rpp-Barrl-Fab30(scFv) 3.6A 
map for modelling Barrl and its interface with M2R. Iterative rounds of 
real-space refinement were performed with phenix.real_space refinein 
PHENIX” and manual model building with Coot*®. Rosetta was periodi- 
cally used to assess whether more optimal models existed. Additional 
refinement near the end of model building was performed with PHE- 
NIX using the state-of-the-art OPLS3e force field with electrostatics. 
A final refinement was performed on the full complex with PHENIX 
using the global M2Rpp-farrl-Fab30(scFv) 4 A map (Extended Data 
Table 1). Independent FSC curves for model-map correlations were 
calculated between the resulting model and the two maps (Extended 
Data Fig. 3a). Model overfitting was evaluated through its refinement 
against the focused interface M2Rpp-fBarrl-Fab30 half maps after 
randomly displacing all atoms by 0.2 A (Extended Data Fig. 3a). 


Modelanalysis 

Electrostatic potential surfaces were calculated with APBS" in Chi- 
mera. Charges were prepared with PDB2PQR® using the PARSE force 
field*®. Tilt angle of arrestin compared with receptor was calculated 
between residues V37 (Barr1), 1317 (Barr1) and R121 (M2) in the case of 
M2Rpp-Barrl, and V2042 (arr), 12324 (arr), and R135 (rhodopsin) for 
the Rho-Arr1 structure. 


Molecular dynamics simulations 

Starting from a model of the M2R-Barrl complex derived from an earlier 
refinement (model available upon request), the OPM webserver“ was 
used to orient the structure with respect to the plane of the lipid bilayer. 
The aligned structure was then prepared further using CHARMM-GUI* 
to place the structure in either a membrane bilayer or a MSP1D1-44 
nanodisc. In both cases a 3:2 ratio of POPC to POPG was used for the lipid 
system. The systems were hydrated with TIP3P water and charge-neu- 
tralized with 150 mM NaCl. Further system preparation was performed 
in VMD**, in which palmitoylation was added to residue Cys457 of the 
receptor. Serines 488, 493, 494, 495 and threonines 490 and 491 were 
phosphorylated with dibasic phosphate. The systems were simulated 
in NAMD* with the OPLS-AA/M force field*®. OPLS-AA parameters for 
iperoxo and LY2119620 were obtained with the LigParGen server”. 
OPLS-AA parameters for POPC were taken from ref. °°, while POPG 
parameters were adapted from POPC and the OPLS-AA small-molecule 
set. OPLS-AA/M parameters for phosphorylated serine and threonine 
were also developed for this work on the basis of existing OPLS-AA 
parameters for phosphates™. Parameters for palmitoylated cysteine 
were developed for this work by combining OPLS-AA/M parameters 
for cysteine with the lipid parameters of ref. °. All simulations followed 
a 2-fs timestep in the NPT ensemble using a Langevin thermostat set 
at 300 K with a dampening coefficient of 1 ps and a Nosé-Hoover 
Langevin piston barostat set at 1 atm with a period of 50 fs anda decay 
of 25 fs. Non-bonded interactions were smoothed starting at 10 Ato 
12 A, where long-range interactions were treated with particle mesh 
Ewald. Systems were minimized for 2,000 steps before being slowly 


heated from 0 K to 300K in 20-K increments with 0.4 ns of simulation 
at each increment. During the heating phase, harmonic restraints of 
1kcal mol A? were applied to all lipid, protein, and small-molecule 
non-hydrogen atoms. The system was then equilibrated with harmonic 
restraints of 1kcal mol! A~ applied to all non-hydrogen protein atoms 
for 2ns, followed by 2 ns of simulation with restraints on only protein 
Caatoms. These restraints were then gradually reduced first to 0.3 kcal 
mol'A7for 2 ns before being removed completely. This aggregate 12 
ns of heating and equilibration and an additional 18 ns of production 
simulation were discarded from the final calculated quantities. Each 
system was then simulated for five replicates of up to 200 ns to produce 
the results used inthis work. Trajectories were analysed with the VMD 
software*®. Interdomain rotation was calculated as described in ref. ”" 
using the inactive and active state crystal structures of arrestin (PDB: 
1CF1° for the inactive structure and PDB: 4ZWJ”) to define the axis of 
rotation. These structures were chosen as references, as the Rho-Arrl 
structure was thought to be the closest analogue to our structure; 
however, results were qualitatively similar to those of other references. 


Confocal microscopy 

HEK293 (ATCC) cells lacking endogenous Barr1/2 as described else- 
where‘ were transiently transfected witha 1:4 DNA ratio of pcDNA- 
teto-M,R and GFP-Barrl(WT) or the 3xD variant, respectively. Cells 
were not authenticated or routinely tested for mycoplasma. Cells 
were split into 35-mm glass bottom microwell dishes (MatTek) and 24 
h thereafter serum-starved for 2 hin MEM media containing 20 mM 
HEPES (pH7.4) and 1 mg mI BSA. Cells were incubated with Alexa- 
650-labelled Flag-M1 antibody and NucBlue Live cell stain (Invitrogen) 
for 1h at 37 °C, washed twice in starvation media, and imaged using 
confocal microscopy. In parallel, transfected cells split into 6-well plates 
were stimulated for 30 min at 37 °C in the presence or absence of 1 
UM iperoxo. Cells were immediately placed on ice and kept at 4 °C for 
the remainder of the experiment. Cells were washed twice with cold 
phosphate-buffered saline and then detached with 0.05% EDTA. Cells 
were resuspended in assay buffer (Hanks balanced salt solution, 20 
mM HEPES pH 7.4,3 mM CaCl, 1 mg mI BSA) and stained with Alexa- 
650-labelled Flag-MI1 antibody for 30 min at 4 °C. Cells were washed 
once with assay buffer before analysis by flow cytometry (Bio-Rad 
$3e Cell Sorter). Data were analysed using FlowJo software, gating for 
GFP-positive singlet cells. 


G-protein GTPase assay 

The GTPase activity of purified heterotrimeric G,;was measuredin vitro 
using the GTPase Glo Assay (Promega) with the following modifications. 
The final reaction consisted of 20 mM HEPES (pH 7.4), 100 mM NaCl, 10 
mMMgCl,,and1mg mI BSA. HDL-M2Rpp (12.5 nM) was pre-incubated 
with iperoxo (10 pM), purified wild-type Barrl-MC-393 or Barrl-MC- 
393(3xD) (1 1M), and Fab30 (1 uM) for 15 min at room temperature. G-pro- 
tein (250 nM) and GTP (2.5 uM) were subsequently added, and reactions 
proceeded for 1h at room temperature before the addition of GTPase 
Gloreagent and ADP, as described inthe manufacturer’s protocol. Lumi- 
nescence was measured ona CLARIOstar plate reader (BMG Labtech). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Characterization of the M2R-farrl complex. 

a, Schematic showing sortase-mediated ligation of GGG-V2Rpp onto GPCRs 
containing a C-terminal sortase consensus sequence (LPETGGH). 

b, Competition radioligand binding experiments using H]NMS to measure 
the affinity of iperoxo for HDL-M2Rpp inthe absence (control; Ctl) (logIC., 
-6.98 + 0.07) or presence of Barr (loglCs_ -8.38 + 0.07), Barrl-minimal cysteine 
(MC) (logIC;9 -8.52 + 0.07) and the mutant Barrl(V70C) (logIC;) -8.34 + 0.06). 
c, Co-immunoprecipitation (IP) of Barrland Fab30 in the presence and absence 
of DDM-Flag—-M2Rpp and DDM-Flag-M2R. Dataare representative of three 
independent experiments. d, Statistical analysis of bimane fluorescence data 
from Fig. 1c. Data are the mean of three independent experiments witherror 
bars representing s.e.m.* indicates significance for the indicated comparison, 
determined by one-way ANOVA. e, Competition radioligand binding 
experiments using [>H]NMS to measure the affinity of the agonist carbachol for 


HDL-M2Rpp in the absence (Ctl; logIC;, —5.38 + 0.12) and presence of LY2119620 
(LY211) (logIC,)-6.7 + 0.10), Barrl and Fab30 (logIC,-6.8 + 0.07) or in 
combination (logIC., -7.95 + 0.06). f, Size-exclusion chromatography of the 
final MSP1D1E3-M2Rpp-farrl-Fab30 complex and SDS-PAGE analysis of peak 
fractions. g,h, Lowresolution cryo-EM analysis of M2Rpp-Barrl-Nb24-scFv30 
complex in MSP1D1HS nanodiscs showing Barr ina ‘hanging’ conformation (g) 
or ‘core’ conformations with ‘rocking’ relative to the nanodisc density (h). 

i, Low-resolution cryo-EM map of the M2Rpp-farrl-Fab30 complex inthe 
larger MSP1D1E3 nanodiscs shows farrlin the ‘core’ conformation involving an 
additionalinteraction of the C domain with the lipid bilayer. All Barrl variants 
are truncated at aminoacid 393. Radioligand binding experiments are the 
means of three independent experiments with error bars representing s.e.m.* 
indicates significance compared to control (P< 0.0001, one-way ANOVA). 
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Extended Data Fig. 2|M2Rpp-Barrl complex conformers and cryo-EM 
reconstruction workflow. a, Conformational variability ofthe M2Rpp-Barrl 
complex. Overlay of two low resolution reconstructions aligned onthe 7TM 
portion reveal variability in the angle of the Barrl-Fab30 segment relative to 
the receptor. One map is shown as mesh and the other as solid surface. The 
constant domains of Fab30 have been masked out inthe reconstructions. For 


145,618 particles 


3D refinement 


nanodisc and Fab30 
constant region subtraction 


clarity, only one receptor-nanodisc density is shown. b, Flow chart of cryo-EM 
data processing towards high-resolution reconstructions. All eight-particle 
classes show engagement of Barrl with the lipid nanodisc, but only one class 
(17.4% of particles) could be further processed to high resolution. A final 
focused refinement yielded a3.6 Astructure providing insights into the 
binding interface and orientation of Barrl relative to M2Rpp. 
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Extended Data Fig. 3| Mapresolution and model validation. a, ‘Gold 
standard’ FSC plots of the reconstruction of M2Rpp-fBarrl-Fab30(scFv) (global 
map, black) and focused refinement reconstruction for interface M2Rpp- 
Barrl-Fab30(scFv) (focused map, blue). The red and brown curves represent 
the correlation between model and map for the global map and the focused 
map, respectively. Refined model validation was also performed by calculating 
the correlation between model and focused half map 1 (green), and the 
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correlation between the randomly displaced model and focused half map 2 
(purple). b,c, Local resolution estimation map for M2R-Barrl-Fab30(scFv) and 
focused map of half-M2Rpp-fBarrl-Fab30(scFv).d, Overall model fit to the 
cryo-EM map. The7TM portion (orange) comes from the full M2Rpp-farrl- 
Fab30(scFv) map contoured at o=7.5, whereas the Barrl-scFv portion comes 
from the focused refinement of Interface M2Rpp-farrl-Fab30(scFv) 
contoured ato=5.2. 
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Extended Data Fig. 4 | Structural comparisons of arrestins. a, The tilt of 
B-arrestin relative to the M2Rpp was measured by the angle (red arrow) 
between V37 (Barrl), 1317 (Barrl) and R1213°° (M2R). b, Superposition of Barrl in 
the M2Rpp-farrl complex and visual arrestin in the crystal structure of Rho- 


C-edge 


) middle loop \_ loops 


_—_— >) 
Tey 
~ C-domain 


Arrl(PDB: 5WOP).c, Superposition of Barrl in the inactive state (grey, PDB: 
1G4M), inthe phosphopeptide-activated state (green, PDB:4JQI) and inthe 
receptor-bound active state (gold, M2Rpp-Barrl). The arrows indicate 
rearrangements of central crest loops. 
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Extended Data Fig. 5 | See next page for caption. 


Distance (A) 


Distance (A) 


OFDM ORD CM OCLWMOHLW CLM OLW OLA CLD OLO 


We 


R121°*° CZ - D69 CG 


Oo 
es 
Oo 


80 120 160 200 
Time (ns) 


N58? CG - D69 CG 


Oo 
By 
Oo 
—_ 
N 
Oo 
= 
[22] 
o 
nN 
i=) 
Oo 


Extended Data Fig. 5 | Computational and experimental examination of 
interactions between the finger loop and M2Rpp. a, Depiction of E61 of the 
finger loop and probable interaction partners R57 and R71 (top) and the R60 of 
the finger loop and D130 of the middle loop (bottom) from PHENIX/OPLS3 
refinement in the cryo-EM map. The mesh depicts the 3.6 A cryo-EM map 
contoured ato=4.0 witha masked 3.0 A zonearoundthe atoms depicted (top) 
orcontoured at o=3.0 witha masked 3.0 A zone around the atoms depicted 
(bottom). b, An overlay of the last frame from 200 ns of simulation for five of 
the molecular dynamics simulation trajectories. D69 of arrestin is depicted as 
bonds in addition to R**° and N2”’ of M2Rpp. Lines connect D69 and R* from 
the same snapshot. c, Plots of the R**° zeta carbon-D69 gamma carbon 
distance (top) and N2*’ gamma carbon-D69 gamma carbon distance (bottom) 


over the course of the molecular dynamics simulations performed in this work. 
Grey lines correspond to raw data, whereas coloured lines correspond toal-ns 
sliding average. Teal traces correspond to simulations with membrane, and 
blue traces correspond to simulations with a small nanodisc. d, Competition 
radioligand binding experiments using [>H]NMS to measure the affinity of the 
agonist iperoxo for HDL-M2Rpp inthe absence and presence of wild-type Barrl, 
Barrl(D69A) and the Barrl(A62-77) mutant that lacks the finger loop. Data are 
the mean of four independent experiments with error bars representing s.e.m. 
Inset, difference in logIC;, between Barrl variants and control (no Barrl). 
*denotes significance compared to wild-type Barrl (P< 0.0001, one-way 
ANOVA). 
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Extended Data Fig. 6| Finger loop rearrangements and ICL2. a, Detail of untwisting. b, Expanded view of L129 (orange) inthe cleft of Barrl overlaid with 
crystal structure of activated Barrlwith phosphopeptide (left, PDB: 4JQI) a modelled phenylalanine (green) and methionine (red) at the same position. 
showing the N-terminal portion of V2Rpp bound on B-strands5 and 6, thereby c, Plot of the frequency of specific amino acids occurring in the second position 
twisting the finger-loop fold. Unbinding of the N-terminal portion of V2Rppis of ICL2 for Barr-binding class AGPCRs. The size of the one-letter code is 
required for the finger loop to adopt the observed conformationinthecryo-EM correlated tothe frequency with which that residue occurs at that position. 
structure of M2Rpp-farrl (right). The arrow shows the direction of finger-loop 
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Extended Data Fig. 7 | Conservation of lipid-binding regions inthe arrestin alignment ofthe arrestin family, showing differential conservation of the 
family. a, Ribbon model of the M2Rpp-Barrlcomplex depictingthelocationof | C-edgeloopsand the PtdIns(4,5)P,-binding motif. 
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Extended Data Fig. 8 | Effects of the lipid membrane on Barrl couplingto 
M2Rpp.a, Atomistic simulations of the M,Rpp-Barrl complex ina membrane 
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inactive-like interdomain twist angle, respectively. b, Statistical analysis of the 
data from Fig. 4c; data represent means of three independent experiments 
witherror bars representing s.e.m.*P< 0.0001, one-way ANOVA; indicating 
significance from control within the same subset. c, Statistical analysis of data 
from Fig.4d; data represent the meanand s.e.m. of three independent 
experiments. *P< 0.0001, one-way ANOVA; denotes significance compared to 
control. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | Barrl C-edge-lipid interaction facilitates M2R 
internalization. a, Flow cytometry analysis of Barr1/2-null cells transiently 
transfected with Flag-M2R and GFP-Barr(WT) or GFP-Barrl(3xD). Cells were 
treated with vehicle or iperoxo for 30 min and subsequently stained with Alexa 
Fluor-650-labelled anti-Flag M1 antibody. GFP* singlet cells were gated for 
analysis. Results are representative of data from three independent 
experiments. b, Quantitation of Flag—-M2R surface staining by flowcytometry, 
as described above. Alexa Fluor-650-labelled anti-Flag M1 staining was 
normalized to the mean fluorescence of unstimulated cells expressing wild- 
type B-arrestinl-GFP in each experiment. These data were used to calculate the 
percentage of receptor internalized in Fig. 5c. Datarepresent the mean and 
standard error from three independent experiments and asterisks indicate 
statistical significance (one-way ANOVA).n.s., not significant. c, Expression of 
GFP-Barrl(WT) or GFP-Barrl(3xD) and Flag—M2R in Barrl1/2-null HEK293 cells 
as assessed by SDS-PAGE and western blot analysis. Tubulin was used asa 
loading control. Data are representative of three independent experiments. 


d, Quantification of GFP-Barrl(WT) or GFP-Barrl(3xD) expression by flow 
cytometry using Barrl/2-null HEK293 cells. Data represent the mean and 
standard error from three independent experiments; *P= 0.0034 (two-sided 
unpaired t-test). e, Localization of GFP-Barrl(WT) or Barrl(3xD) in Flag— 
vasopressin-2-receptor overexpressing HEK293 cells treated with arginine 
vasopressin peptide (AVP) for the indicated time. Data are representative of 
three independent experiments. f, Three-site interaction network of GPCR-B- 
arrestin binding. Inthe classic two-site interaction model, conformational 
changes in B-arrestin induced by binding to phosphorylated receptor (1) leads 
to transmembrane receptor core coupling (2) to sterically block G protein 
binding. Our findings suggest an expanded model including interaction of the 
C domain of B-arrestin with the lipid bilayer (3) because it synergistically 
enhances the interaction of B-arrestin with the phosphorylated receptor tail/ 
loops and transmembrane core. Vertical arrows inthe receptor represent 
directionand strength of cooperativity between the extracellular orthosteric 
ligand-binding and intracellular transducer-binding sites. 


Extended Data Table 1| Cryo-EM data collection, refinement, and validation statistics 


M2R-Barr1- InterfaceM2R- 
Fab3 0(scFv) Barr1- 
(EMDB-20612) Fab30(scFv) 
(PDB 6U1N) (EMDB- 
20948) 
Data collection and processing 
Magnification 47,169 47,169 
Voltage (kV) 300 300 
Electron exposure (e-/A’) 50 50 
Defocus range (um) 1.2-2.2 1.2-2.2 
Pixel size (A) 1.06 1.06 
Symmetry imposed Cl Cl 
Initial particle images (no.) 11.7million 11.7million 
Final particle images (no.) 145,618 145,618 
Map resolution (A) 4A 3.6A 
FSC threshold 0.143 0.143 
Map resolution range (A) 3.6-6 3.2-5 
Refinement 
Initial model used (PDB code) 601K 
A4JQI 
Model resolution (A) 4.03 
FSC threshold 0.5 
Map sharpening B factor (A’) 150 
Model composition 
Non-hydrogen atoms 5863 
Protein atoms 5834 
Ligands 29 
B factors (A’*) 
Protein 164.85 
Ligand 292.74 
R.m.s. deviations 
Bond lengths (A) 0.005 
Bond angles (°) 0.682 
Validation 
MolProbity score 2.2 
Clashscore 18.0 
Poor rotamers (%) 1.0 
Ramachandran plot 
Favored (%) 91.6 
Allowed (%) 8.4 


Disallowed (%) 0.0 
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AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Xx] 
OOO oO 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection SerialEM 3.6 software was used for automated cryoEM data collection. 


Data analysis Data from radioligand binding, bimane fluorescence, and GTPase Glo assay were analyzed used GraphPad Prism v7. Flow cytometry 
data were analyzed using Prosort v.1.5.0.15 and FlowJo 10. CryoEM image processing and 3D reconstructions conducted using 
MotionCor2 1.3.0, Gctf 1.06, relion3.0 and cisTEM 1.0.0. Bsoft 2.0.5 was used for local resolution determination and Phenix 1.15 for map 
fsc curve calculation and model validation. Coot 0.8.9.1 was used for model building. Molecular dynamics experiments were run with 
NAMD v2.13 and analyzed with VMD v1.9.3. Chimera v1.12 was used for electrostatic analysis. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


Atomic coordinates have been deposited in the Protein Data Bank (PDB) under accession number 6U11N and the cryoEM density maps in EMDB under number 
20612 and 20948 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


Life sciences |] Behavioural & social sciences [| Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Since no inferences were determined for a population based on a sample, no sample size determination was needed in this study, 
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Data exclusions No data were excluded from analysis 


Replication A minimum of three independent experiments were conducted for radioligand binding, flow cytometry, bimane fluorescence, confocal 
microscopy, and GTPase Glo assays. All attempts at replication were successful with no data exclusions. 


Randomization |The independent variables of the experimental design were well controlled and did not require randomization 


Blinding Experimental results were all quantitative and did not require any subjective analysis, thus no experiments were preformed with blinding 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
| Antibodies x] | ChIP-seq 
| Eukaryotic cell lines | xX Flow cytometry 
| Palaeontology XI} [| Mri-based neuroimaging 


[_] Animals and other organisms 


| Human research participants 


| Clinical data 


Antibodies 


Antibodies used Anti-FLAG M1 used for receptor purification, flow cytometry (120nM), and confocal microscopy (30nM) was purified from ATCC 
hybridoma line HB-9259 (Clone 4E11). Anti-FLAG-M2-HRP (Sigma; A8592) and Tubulin (Sigma; T7816) were used in western 
blots at concentrations of 1:1000 and 1:5000, respectively. 


Validation Certificate of analysis of FLAG-M1 is available from ATCC website and was further validated by pull-down assays and flow 
cytometry. Specificity of FLAG-M2-HRP was determined by including control cell lysate not expressing FLAG-tagged receptor. 
Tubulin antibody was validated by western blot analysis and compared to that illustrated on the Sigma specification sheet 


Eukaryotic cell lines 


Policy information about cell lines 
Cell line source(s) Expi293F cells: ThermoFisher. The deriative tetracycline-inducible Expi293F cell line used for M2R expression was reported 
in Staus et. al. Proc Natl. Acad. Sci. USA 115, 3834-3839 (2018). Beta-arrestin1/2 knockout HEK293 cell lines (Dr. Asuka 
Inoue, Tohoku University) were characterized and reported in Grundmann et. al. Nat Commun. 2018;9(1):341. 
Authentication No additional authentication of cell lines was performed 


Mycoplasma contamination Cells were not tested for mycoplasma contamination 


Commonly misidentified lines No commonly misidentified cell lines were used 
(See ICLAC register) 


Flow Cytometry 
Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a ‘group’ is an analysis of identical markers). 


| ]All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation beta-Arrestin1/2 null HEK293 cells (courtesy of A. Inoue) overexpressing GFP-beta-arrestin1 constructs were stained with 
AlexaFluor650-M11 (anti-FLAG) antibody. 
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Instrument Bio-Rad S3e Cell Sorter 
Software Data were collected using ProSort software and analyzed in FlowJo 10. 
Cell population abundance __ N/A, cells were not sorted. 


Gating strategy N/A 


[| Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Arrestin proteins bind to active, phosphorylated G-protein-coupled receptors 
(GPCRs), thereby preventing G-protein coupling, triggering receptor internalization 
and affecting various downstream signalling pathways’”. Although there is a wealth of 
structural information detailing the interactions between GPCRs and G proteins, less 


is known about howarrestins engage GPCRs. Here we report acryo-electron 
microscopy structure of full-length human neurotensin receptor 1(NTSRI1) in 
complex with truncated human B-arrestin 1 (Barrl(ACT)). We find that 
phosphorylation of NTSR1is critical for the formation of a stable complex with 
Barrl(ACT), and identify phosphorylated sites in both the third intracellular loop and 
the C terminus that may promote this interaction. In addition, we observe a 
phosphatidylinositol-4,5-bisphosphate molecule forming a bridge between the 
membrane side of NTSR1 transmembrane segments 1 and 4 and the C-lobe of arrestin. 
Compared witha structure of a rhodopsin-arrestin-1 complex, in our structure 
arrestinis rotated by approximately 85° relative to the receptor. These findings 
highlight both conserved aspects and plasticity among arrestin-receptor 


interactions. 


Upon activation, GPCRs signal through G-protein pathways and arres- 
tin pathways to regulate downstream cellular events. Recent studies 
suggest that drugs that direct signalling through only one of these 
pathways (knownas biased drugs) may have fewer side effects® >. There 
are four isoforms of arrestin, of which arrestin-1 (Arr) and arrestin-4 
(Arr4) are known as visual arrestins. Arrl is expressed in retinal rods 
and is important in turning off activated rhodopsin, whereas Arr4 is 
expressed in cones and deactivates colour opsins. The non-visual arres- 
tins—arrestin-2 and arrestin-3, also known as f-arrestin 1 (Barr1) and 
B-arrestin 2 (Barr2)—are important in suppressing G-protein signalling 
for most other GPCRs. In addition to their role in desensitization, non- 
visual arrestins regulate GPCR endocytosis through interactions with 
the clathrin-dependent endocytic machinery, and can act as scaffolds 
for various other cytosolic signalling molecules downstream of GPCR 
activation, such as Src family tyrosine kinases and MAP kinases!?°’”. 
Owing to recent breakthroughs in structural biology, structures of 
family AGPCR-G-protein complexes have been resolved for the three 
major classes of G proteins (G,, G,, and G,)*°; however, the only avail- 
able structure of aGPCR-arrestin complex is that of rhodopsin-Arr1". 
Despite the structural similarities between rhodopsin and other family 
A GPCRs, there are fundamental differences in their activation kinet- 
ics, phosphorylation and arrestin coupling”. To better understand 
the molecular details of the arrestin pathway, and how arrestins can 
interact with a diverse array of GPCRs, we initiated efforts to obtain 
structures of non-rhodopsin GPCR-f-arrestin complexes. 


Barrl was first identified as a regulator of B, adrenergic receptor 
(B,AR) signalling’. Although the B,AR is one of the most extensively 
studied model systems for family A GPCRs, initial efforts to forma 
stable B.AR-Barrl complex for structural studies were not success- 
ful. To identify a more stable GPCR-B-arrestin complex, we devised a 
fluorescence-based screen in which various candidate family A recep- 
tors were expressed in HEK293 cells and evaluated for their ability to 
couple to activated Barrlin a manner independent of the phosphoryla- 
tion status of the receptor (Extended Data Fig. 1). This screen revealed 
that the neurotensin receptor 1 (NTSR1) complexed with Barrlin vitro, 
and showed promise as a candidate for structural studies. 

NTSRI1 mediates responses to neurotensin (NTS) and neuromedin 
N, both of which are derived from the same precursor peptide”. NTS 
regulates a broad spectrum of physiological processes® including 
blood pressure, ileum contraction or relaxation, analgesia and hypo- 
thermia. Although there are three subtypes of neurotensin receptor, 
most of the physiological responses of NTS are mediated by NTSRI1. 
NTSR1is a promiscuous GPCR, which couples to G,, Gyn, Gand Gyp3°°. 
Both active- and inactive-state structures of NTSR1 have been deter- 
mined by crystallography”, and recently structures of the NTSR1-G, 
complex have been determined by cryo-electron microscopy (cryo- 
EM)’. Notably, the cryo-EM analysis revealed two distinct conforma- 
tions of the NTSR1-G, complex, termed the canonical state (C-state) 
and the non-canonical state (NC-state). The C-state is similar to other 
GPCR-G-protein complexes, whereas the NC-state may represent 
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NTSR1 b 


NTSR1 

C-tail 
Fig. 1| The NTSR1-Barr1(ACT) complex features a distinct arrestin 
orientation and unique interaction sites. a,b, Cryo-EM map (a) and model (b) 
of phosphorylated NTSR1 (yellow) bound to the peptide agonist NTS, _,; (green) 
in complex with Barrl(ACT) (blue). Interactions between the receptor and 
Barrl(ACT) are mediated by the phosphorylated Cterminus of NTSR1, the 
C-terminal part of the phosphorylated ICL3, the receptor coreand a PtdIns(4,5) 
P, molecule (pink). The C-edge of Barrl(ACT) is in contact with the detergent 


an intermediate along the activation pathway’. Recent studies have 
shown that an arrestin-biased positive allosteric modulator for NTSR1 
suppressed self-administration of methamphetamine in rats’, provid- 
ing further incentive to elucidate the structure of the NTSR1-farrl 
complex. 


Preparation of the NTSR1-fBarrl complex 


Given the importance of receptor phosphorylation for arrestin cou- 
pling, we chose to use full-length, native NTSR1 bound to the agonist 
NTS, _,;, (amino acids 8-13 of NTS), and phosphorylated the receptor 
in vitro using GPCR kinase subtype 5 (GRKS) following a protocol estab- 
lished forthe B, adrenergic receptor”. GPCR kinases (GRKs) facilitate 
arrestin-mediated desensitization and signalling by phosphorylat- 
ing the receptor on its intracellular loops and its C terminus, which 
increases the ability of the receptor to bind and activate arrestins”. 
Previous analysis of GRK-mediated phosphorylation of NTSR1 indicated 
that whereas both GRK2 and GRKS phosphorylate the C terminus, GRKS 
also phosphorylates the third intracellular loop (ICL3)™*. The extent 
of phosphorylation was monitored by analytical ion-exchange chro- 
matography (Extended Data Figs. 2a, b, 3a). The unphosphorylated 
receptor did not complex to full-length Barrl, whereas GRK5-phospho- 
rylated NTSR1 did (Extended Data Fig. 2c, d). Moreover, pre-activated 
Barrl—truncated at residue 382 (Barrl(ACT))”°—enhanced this coupling 
(Extended Data Fig. 2c, d). We therefore chose to use this construct for 
subsequent experiments. 

Negative-stain electron microscopy images of the purified complex” 
showed particles in which arrestin was tightly engaged with the core 
of the receptor; however, much of the complex dissociated during 
vitrification. To address this challenge, we treated the purified complex 
witha heterobifunctional crosslinker, sulfosuccinimidyl 6-(4,4’-azipen- 
tanamido)hexanoate (sulfo-LC-SDA), which has an amine-reactive 
group on one end and a diazirine moiety on the other. After labelling 
accessible lysine residues, the complex was exposed to ultraviolet 
light to activate the diazirine and trigger crosslinking, followed by 
purification througha second round of size-exclusion chromatography 
(Extended Data Fig. 3a). The sample seemed qualitatively unchanged 
according to negative-stain electron microscopy images (Extended 
Data Fig. 3b) and, although we observe a relatively small fraction of 
crosslinked complex by SDS-PAGE, the photo-crosslinked sample 
remained intact during sample vitrification and afforded homogene- 
ously dispersed particles that were suitable for cryo-EM. Mass spec- 
trometry was used to analyse the effect of the photo-crosslinker onthe 
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NTSR1-Barr1 (ACT) 
—Arrt 


Cytoplasmic 
view TMS 


eo, TM6 


Barr1 (ACT) 
C-lobe 


micelle (translucent grey). c, Overlay of the NTSR1-Barrl(ACT) structure with 
the rhodopsin-arrestin-1 structure (withrhodopsin and Arrl coloured pink and 
green, respectively), viewed from the cytoplasmic side. The structures were 
aligned on the basis of the receptor chains (helices shown as cylinders). 
Compared with Barrl(ACT), Arrlis rotated by approximately 85°. Additional 
views of this overlay are shown in Extended Data Fig. 7a. 


NTSR1-Barrl(ACT) complex. We found that, although several lysine 
residues on the N-lobe of Barr1(ACT) as well as in the ICL1 and ICL2 
regions of NTSR1 reacted with the crosslinker, they largely resulted in 
dead-end crosslinks during photo-illumination (Extended Data Fig. 4, 
Supplementary Tables 1-7). The majority of the observed crosslinks 
were intramolecular and largely confined toICL2 of the NTSR1 and the 
loops of the N-lobe of Barr1(ACT). Only a single intermolecular crosslink 
that was consistent with our structural model was identified—between 
the cytoplasmic end of receptor transmembrane helix 1 (TM1) and a 
B-strand of Barr1(ACT) (Extended Data Fig. 4c)—although its density is 
not observed in our map. Itis therefore possible that the main effect of 
crosslinking was to stabilize Barr1(ACT) in anative conformation with 
enhanced affinity for NTSR1, thereby enabling the complex to survive 
during cryo-EM grid preparation. 


Cryo-EM of the NTSR1-Barr1(ACT) complex 


A large dataset of 18,000 micrographs of the photo-crosslinked NTSR1- 
Barr1(ACT) complex was processed in subsets and combined to yield 
about 600,000 particles from well-defined three-dimensional (3D) 
classes, which were reconstructed to an overall 4.5 A map. Further 
3D classification using a smaller angular step enabled us to identify a 
subset of about 260,000 particle projections that were used to obtain 
amap witha global indicated resolution of 4.2 A (Extended Data Figs.5, 
6a-d, Extended Data Table 1). Conformational heterogeneity in this 
subset was probed using multi-body refinement”, which showed that 
approximately 40% of the variance in the rotations and translations 
between NTSR1 and Barr1(ACT) is accounted for by two eigenvectors 
(Extended Data Fig. 6e-h). However, local resolution could not be fur- 
ther improved. 


Structure of the NTSRI1-Barr1(ACT) complex 


The 4.2 A map (Fig. 1a) was sufficient to build a model of the complex 
(Fig. 1b), with the exception of the Nterminus of the receptor (residues 
1-49), parts of ICL3 (residues 273-285) and part of the C terminus of the 
receptor. Thetransmembrane helices of NTSR1 have well-defined den- 
sity, and most regions of Barrl(ACT) were well-resolved, with the excep- 
tion of the flexible end of the truncated Cterminus (residues 352-382) 
and parts of the C-edge loops (residues 332-340). The receptor engages 
Barr1(ACT) through a portion of its C terminus, the transmembrane 
core and the C-terminal end of ICL3. The C terminus of NTSR1 binds 
the N-lobe groove of Barrl(ACT), which contains basic residues that 
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Intracellular — NTSR1-farri (ACT) 


Fig. 2| Barr1(ACT)-bound NTSR1 adopts a conformation similar to that of G,- 
bound states. a, b, Side view (a) and cytoplasmic view (b) showing the overlay 
of Barrl(ACT)-bound NTSRI1 (yellow) with inactive NTSR1 (grey), and the 
canonical G,-bound NTSR1structures (cG,, teal). TMS in the NTSR1-BArr1(ACT) 
complex has alarger outward displacement than in either NTSR1-G, complex, 
while TM6 adopts aconformation similar to that in the canonical NTSR1-G, 
complex. Outward motions in TMS and TM6 of NTSRI, from theinactivetothe 
Barrl(ACT)-bound state and from the cG,-bound to the Barrl(ACT)-bound state, 
are indicated by orange and purple arrows, respectively. 


are known to interact with phosphorylated receptor residues”®. The 
receptor core engages the Barrl(ACT) finger loop; this interaction is 
a critical determinant of arrestin coupling. We also observe a strong 
density above the Barr1(ACT) C-lobe, inthe region that was previously 
shown to be an inositol phosphate binding site” *. Given that we added 
dioctyl-phosphatidylinositol-4,5-bisphosphate (diC8-PtdIns(4,5)P,) 
during the phosphorylation step, and its presence was confirmed by 
mass spectroscopy analysis of the intact complex (Supplementary 
Fig. 1), we conclude that this density corresponds to diC8-PtdIns(4,5) 
P,. The functional role of the binding of PtdInsP, is discussed below. 
We also observe that the C-edge-that is, parts of the 340-loop (resi- 
dues 330-340) and 191-loop (residues 186-198)—seems to be in con- 
tact with the detergent micelle. In Arrl, these loops—named 344-loop 
and 197-loop—have been shown to interact with the membrane upon 
arrestin activation”. Notably, and as previously discussed’, only Arr1, 
Arr4andthe long splice variant of Barrl have the 344- or the 340-loop, 
whereas Barr2 and the short splice variant of Barrl do not possess this 
loop region. No substantial difference in the ability ofthese isoforms to 
bind tothe B,AR and M2 receptors was observed**; however, this loop 
region may be important in regulating the strength of engagement of 
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Central crest loops 


Fig.3| NTSR1-bound Barr1(ACT) shows activation hallmarks, withsome 
loops in distinct conformations. a, Overlay of NTSR1-bound Barrl(ACT) (blue) 
with inactive Barrl (wheat). NTSR1-bound Barrl(ACT) undergoes a16° 
interdomain twist, a feature observed to varying degrees in all active-state 
arrestin structures (Supplementary Fig. 8a), and substantial rearrangements of 
the central crest loops (highlighted in purple and orange, respectively). FL, 
finger loop; GL, gate loop; LL, lariat loop; ML, middle loop.b, The position of 


SY 


arrestin upon activation, and seems to havean important role in stabiliz- 
ing our complex. The interaction between the micelle and the C-edge 
leads to a tilt of Barr1(ACT) relative to the receptor (Fig. 1a, b), which 
may be exaggerated by virtue of the small size of a detergent micelle 
compared witha planar bilayer. The implications of this flexibility and 
the role of membrane curvature are discussed below. 

Notably, Barrl(ACT) engages NTSR1in a relative orientation that is 
rotated by approximately 85° in the membrane plane compared with 
how Arrl engages rhodopsin (Fig. 1c, Extended Data Fig. 7a). As dis- 
cussed below, even though both complexes use the same GPCR/arrestin 
interface, differences in the phosphorylation-mediated interactions, 
the engagement of the receptor core and the binding of PtdIns(4,5)P, 
could explain this substantial difference. We suggest that this highlights 
the exceptional plasticity of arrestin and illustrates the spectrum of 
conceivable receptor-arrestin assemblies. It should be noted that 
it has previously been anticipated that arrestins may adopt distinct, 
receptor-specific orientations”®. 


Conformational changes in NTSR1 


The hallmark feature of the activation of family A GPCRs—that is, the 
outward movement of TM6-—is clearly observed in NTSR1 (Fig. 2).Com- 
pared with the previously published structure of inactive NTSRI1, both 
TM6 and TMS inthe NTSR1-Parrl(ACT) structure move away fromthe 
core by 10.9 Aand5.0A, respectively (Fig. 2b), adoptingan active con- 
formation similar to that seen in the C-state of the G,-coupled NTSR1 
structure (discussed above)’, with a root-mean-square deviation of 0.67 
A. Whereas the displacement of TM6 was similar to that seen in both 
G,-bound states, TM5 is moved further outward, by 2.1 A (by measuring 
the Ca of A270) (Fig. 2b). 


Conformational changes in Barr1(ACT) 

Barrl(ACT) bound to NTSR1 adopts a structure that contains the con- 
formational hallmarks of arrestin activation’; we propose this on the 
basis of a comparison with crystal structures of the inactive Barr1*, 
the active-state Barrl bound to a phosphopeptide derived from the 
vasopressin receptor C-tail anda stabilizing Fab® (referred to as V2Rpp- 
Barr), and the active-state Arrl bound to rhodopsin" (referred to as 
Rho-Arrl). First, we observe an interdomain twist—that is, a rotation 
of the C-lobe relative to the N-lobe (Fig. 3a)—of around 16°, which is 
smaller than the twist of approximately 22° that is observed inthe struc- 
tures of V2Rpp-farrl and Rho-Arrl (Extended Data Fig. 7b). Second, 
the finger, gate and middle loops—which form the central crest and 
are essential in receptor coupling—adopt active-state conformations 
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residue D297 (shown as Ca sphere) in the gate loop of the NTSR1-farrl(ACT) 
complex is too far away from R169 (shownas Ca sphere) to stabilize the polar 
coreinthe inactive state. c, Compared withthe active state Rho-bound Arrl 
(green) and V2Rpp-bound Barrl (orange), the middle loop, gate loop and lariat 
loop of NTSR1-bound Barrl(ACT) adopt similar conformations, while the finger 
loop andthe C-loop (CL) adopt more distinct conformations, probably dueto 
the unique receptor-arrestin orientation that we observe. 
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Fig. 4| Comparison of receptor/arrestin and receptor/G-protein interfaces. 
a, Acomparison of the NTSR1/Barrl(ACT) interactioninterface (left) with the 
Rho/Arrlinteraction interface (right) highlights the conformational plasticity 
of arrestin and the 85° rotation between how Arrl1 and Barrl(ACT) engage 
rhodopsin and NTSRI1, respectively. The same arrestin elements interact with 
distinct regions onthe receptors to form a unique interface. b, Engagement of 
the receptor core by the finger loop of the arrestin (left) and by the Ga, «5-helix 
(right). 


(Fig. 3a). Third, we observe that the polar core is disrupted, as evidenced 
bythe backbone position of residue D297 in the gate loop being moved 
away from R169, abrogating a key contact that stabilizes the inactive 
state (Fig. 3b). Moreover, because Barrl(ACT) is truncated at residue 
382, residue R393 is no longer present to contribute to the formation 
of the polar core. Notably, the backbones of the gate loop and mid- 
dle loop in our structure overlay well with those of the Rho-Arrl and 
V2Rpp-farrl complexes, whereas the finger loop andthe C-loop adopt 
distinct conformations (Fig. 3c). 


NTSRI1/Barr1(ACT) interactioninterface 


Severalinteraction elements stabilize the NTSR1-Barr1l(ACT) complex, 
and they probably explain the unique orientation that we observe com- 
pared with the Rho-Arrl1 complex (Fig. 1c). Although the finger loop, 
C-loop, lariat loop and middle loop of Arrl and Barr1(ACT) are part of 
the interface, they interact with different regions of rhodopsin and 
NTSRI, respectively, resulting in very different orientations of arrestin 
inthe twocomplexes (Fig. 4a). In the NTSR1-Barrl(ACT) complex, the 
finger loop adopts a helical structure—which is also seen for Arrl1?”—and 
itinsertsinto the receptor intracellular cavity, butit is extended farther 
away from the N-lobe of arrestin relative to the finger loop of Arrlin 
the rhodopsin complex" (Figs. 3c, 4a). The finger loop of Barr1(ACT) 
occupies a position similar to that of the a5-helix of Ga,in the NTSR1- 
G, complex (Fig. 4b). In the NTSR1-farrl(ACT) complex, ICL2 forms 
a helix and is located on the outer side of the C-loop, whereas in the 
Rho-Arrl complex ICL2 is sandwiched between the C-loop, middle 
loop and lariat loop (Figs. 4a, 5a). Itis notable that the ICL2 of rhodopsin 
does not form ahelix in the G,-bound structure, whereas it does in the 
Arr1-bound structure; this is in contrast to NTSRI, in which ICL2 
adopts a very similar conformation in its G,-bound and Barrl(ACT)- 
bound states. 
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NTSRI phosphorylation stabilizes the complex 

As noted above, receptor phosphorylation is essential for the forma- 
tion of a stable complex (Extended Data Fig. 2). Phosphoproteomics 
experiments on a pre-formed NTSR1-farrl(ACT) complex identified 
four phosphorylation sites inICL3 and six inthe C terminus (Extended 
Data Fig. 8a, Supplementary Figs. 2-5, Supplementary Tables 8-39), 
with less than full occupancy of each position. We propose that the 
heterogeneity we observe is the result of distributive phosphorylation 
by GRKS in vitro—consistent with literature reports stating that GRKs 
have limited sequence recognition constraints*—and might even be 
important to enable arrestin to adopt arange of conformational states. 

We observed density corresponding to part of the C terminus of 
NTSR1 in the N-lobe of Barrl(ACT) (Fig. 5b), with a distinct bulge inthe 
density adjacent to K294 of Barrl(ACT) consistent with a phosphate; 
however, owing to the resolution we cannot definitively determine the 
register of the C terminus of NTSRI within this density. On the basis 
of the distance between the last amino acid observed for H8 and the 
density observed for the C-terminal peptide, we can probably exclude 
$396, S401, S403 and S404 from being at this position. As such, it is 
possible that our structure represents p1407 making a contact to K294, 
and we have modelled it as such in Fig. 5b. Both the V2Rpp-Barrl and 
the Rho-Arrl complexes have a phosphate inthe same position, inter- 
acting with the gate loop lysine and thereby stabilizing arrestin in the 
active conformation (Extended Data Fig. 8b). 

We also observe strong density forthe C-terminal end of ICL3, which 
turns sharply at the end of TM6 and extends behind TMS tothe base of 
the finger loop (Fig. 5c, d). The density for residue S287 of ICL3, which 
is phosphorylated in our complex, is adjacent to residues R76 and K77 
of Barr1(ACT) at the base of the finger loop. We speculate that phos- 
phorylation of S287, but potentially any phosphorylated residue in the 
inherently flexible ICL3, could serve to disrupt the contact between 
K77 and E313 that maintains arrestin in the inactive conformation*®. 
It is noteworthy that, inthe V2Rpp-Barrl structure, K77 is stabilized 
in the active state by pT347 of the V2Rpp* (Extended Data Fig. 8b). 


PtdIns(4,5)P,, stabilizes NTSR1-Barr1(ACT) 

Inthe NTSR1-farr1(ACT) structure, arrestin is strongly tilted towards 
the membrane, at an angle of about 40° relative to the membrane 
plane (Fig. 1a, b); this same angle is only about 15° for Rho-Arr1. The 
difference in tilt may be attributed to the interaction of the C-edge 
with the detergent micelle, as well as to the presence of PtdIns(4,5)P, 
bridging the membrane surface of TM1 and TM4 with the top of the 
C-lobe of arrestin. Although the observation of a PtdIns(4,5)P, mol- 
ecule bridging NTSR1 and Barr1(ACT) was unexpected, this region of 
Barrland Barr2 was previously shown to bind inositol phosphates”? #1, 
and native mass spectrometry experiments had shown that NTSRI1 
was capable of binding to PtdIns(4,5)P, through contacts between 
positively charged residues on the membrane-facing side of TM1 and 
TM4”. We confirmed the presence of PtdIns(4,5)P, in our complex by 
mass spectrometry, and determined that a fluorescent PtdIns(4,5)P, 
analogue could bind to bothNTSRI (dissociation constant, K,= 0.3 pM) 
and Barr1(ACT) (K,=0.9 uM) (Supplementary Fig. 1). PtdIns(4,5)P, fits 
wellinto the density observed between the detergent micelle and the 
C-lobe of Barr1(ACT) (Fig. 5e, f). Although the precise orientation of 
the phosphatidylinositol head group is ambiguous at the resolution 
of our map, the modelled position places the phosphates at positions 
4 and 5 within range to interact with R236, K250, K324 and K326 of 
Barrl(ACT). The hydroxyl group at position 3 of the PtdIns(4,5)P, head 
group can forma hydrogen bond with R182 on TM4 of the receptor, 
and the bridging phosphate can form a hydrogen bond with Y103 on 
TM2 (Fig. 5f). Using a NanoBiT assay*”, we examinedin cells the effect 
of mutating residues that are involved in the binding of PtdIns(4,5)P,to 
arrestin?*°’, When stimulated by an agonist, the mutant Barrl(K232Q/ 
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Fig. 5| The NTSR1/Barr1(ACT) interface comprises several distinct types of 
interactions. Multiple interaction interfaces (dashed boxes) stabilize the 
NTSR1-farrl(ACT) complex. a-f, Close-up view of distinct interaction 
interfaces, with local density from the electron microscopy map shownas grey 
mesh.a, The C-loop (teal) isin close proximity to ICL2, possibly interacting 
through hydrophobic contacts. b, The gate loopis inclose proximity to part of 
the C-tail of NTSR1. A bulge in the map density suggests that a 

phosphorylated residue would be positioned as shown to interact with K294, 
thus disrupting the polar core. c, The finger loop (teal) inserts into the receptor 
cavity formed by outward motion of TM6. d, The C-terminal part of ICL3 adopts 
a distinct orientation, forming asharp angle with the intracellular end of TM6. 
Acharge-charge interaction between a phosphorylated residue, such as S287, 


R236Q/K250Q)—containing lysine-to-glutamine substitutions at resi- 
dues 232 and 250, and an arginine-to-glutamine substitution at residue 
236—showed an approximately 40% reductionin recruitment of arrestin 
to NTSR1 when compared with the wild-type (Fig. 5g). This is consist- 
ent with the binding of arrestin to PtdIns(4,5)P, being important for 
complex formation with NTSRI1. 


Diversity of GPCR-arrestin interactions 


We present the structure of the NTSR1-Barrl(ACT) complex using a 
native receptor that is phosphorylated in vitro by GRK5. Acomparison 
with the structures of the Rho-Arrl complex and with the M2 mus- 
carinic receptor(M2R)-Barrl complex in lipid nanodiscs—described in 
arelated study*'—highlights the plasticity and diversity of interactions 
that comprise the receptor/arrestin interface. 

We found that the binding of PtdIns(4,5)P, atthe NTSR1/Barrl(ACT) 
interface stabilizes the complex and may impose a stronger arrestin 
tilt relative to the membrane planecompared to that in M2R-Barrland 
Rho-Arr1. This tilt may be exaggerated by the smaller hydrophobic 
interface that a detergent micelle provides compared with a planar 
lipid bilayer. Conformational analysis of the cryo-EM projections 
indicates the least-inclined state captured is tilted to a comparable 
extent to that seen in Rho-Arrl and M2R-farrl1. However, even the 
most-tilted orientation may be meaningful in the context of non-pla- 
nar membrane structures that can feature high degrees of curvature, 
such as endosomes, from which GPCR-arrestin signalling can also 
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and positively charged residues at the base of the finger loop (R76 and K77) may 
explain this unusual loop conformation. e, f, A PtdIns(4,5)P, molecule interacts 
with basic residues of the Barrl(ACT) C-lobe groove and residues onthe 
membrane-facing side of TM1, TM2 and TM4. g, A NanoBiT-based proximity 
assay shows that the triple mutant Barrl(K232Q/R236Q/K250Q), in which 
residues required for the binding of PtdIns(4,5)P, are mutated, is recruited to 
activated NTSRI1toalesser extent (*P=0.02) than wild-type Barrl. Data points 
and error bars represent data from individual experiments ands.d., 
respectively. The maximum response (F,,,x) of NTS-stimulated Barrl 
recruitment to NTSR1is 11.2 + 2.8 for wild-type Barrl and 6.4 +2.4 for 
Barrl(K232Q/R236Q/K250Q). One-way analysis of variance, n=5 biologically 
independent experiments. 


occur” (Extended Data Fig. 9). The observations that interactions with 
membrane phosphoinositides enable arrestins to maintain signalling- 
competent conformations after dissociating from GPCRs (referred to 
as ‘action at a distance’)—and that these dissociated arrestins, as well 
as GPCR-arrestin complexes, can signal from clathrin-coated struc- 
tures**“—further supports the notion that PtdIns(4,5)P, and membrane 
curvature may have a role in arrestin function. As such, once arrestin 
is activated by the receptor core, C-tail and/or ICL3 engagement, the 
binding of specific lipids such as PtdIns(4,5)P,, and the insertion of 
the C-edge into the lipid bilayer may contribute to the maintenance 
of anactive conformation that no longer requires the presence of the 
receptor. It has been shown that NTSRI-G, mediated signalling acti- 
vates phospholipase C, and the production of inositol phosphate and 
diacylglycerol from PtdIns(4,5)P,, as detailed in a previous publica- 
tion’’. Thus, NTSR1 signalling through G, can change the membrane 
content of PtdIns(4,5)P, and possibly influence interactions between 
NTSR1 and Barr. 

Compared with both Rho-Arrl and M2R-farrl, the relative orienta- 
tion between the receptor and arrestin that we observe for NTSR1- 
Barrl(ACT) is unique. This suggests that the same arrestin regions can 
engage receptors in different ways, and that the orientation is modu- 
lated by the nature of the interactions present. For example, the inter- 
action with the phosphorylated ICL3 as well as binding to PtdIns(4,5) 
P,—which are observed onlyinthe NTSR1-farrl(ACT) complex—can be 
satisfied only by adopting a different relative orientation of arrestin. 
Within arrestin we observe different conformations of the finger loop 
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between NTSR1-Barrl(ACT), M2R-Barrl and Rho-Arrl1, suggesting 
that this loop can be ordered in different ways; this may be important 
for the recognition of different receptors. The ability of arrestin to 
assume multiple conformations***’, including different twist angles 
and relative orientations, might be important in enabling binding to 
differently phosphorylated receptors. Taken together, this high degree 
of conformational plasticity and the multiplicity of interface contacts 
might be what enables only two B-arrestin isoforms to interact with 
a large number of GPCRs that share little sequence homology at the 
arrestin/receptor interface. 
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Methods 


Data analysis and figure preparation 

Figures were created using the PyMOL 2.3.2 Molecular Graphics Sys- 
tem (Schrédinger), and the UCSF Chimera X 0.9 package“. Data were 
plotted with GraphPad Prism 8.3.0. Mass spectrometry data were 
analysed with Byonic v.3.4 and Byologic v.3.4-55 (Protein Metrics), 
DataAnalysis v.5.2 (Bruker Daltonics) and Excalibur v.4.1 (Thermo 
Fisher Scientific). 


Screening GPCRs for strong arrestin coupling 

GPCRs with an N-terminal Flag tag and a C-terminal eGFP followed 
by deca-His and Rho1D4 tags were constructed to pCDNA3.1(+) and 
expressed in HEK293T cells (ATCC, confirmed mycoplasma negative 
by manufacturer). Cells were transfected with Lipofectamine 2000 
(Invitrogen) and incubated for 72 h before being collected. Cell pel- 
lets were solubilized in solubilization buffer (1% lauryl maltose neo- 
pentyl glycol (LMNG, Anatrace), 0.1% cholesteryl hemisuccinate tris 
salt (CHS, Steraloids), 500 mM NaCl, 20 mM HEPES 7.4, 25% glycerol, 
protease inhibitors and 1 pM ligand) for 2 h at 4 °C. After centrifuga- 
tion, the supernatant was incubated with Ni-NTA for 30 min at 4 °C. 
The resin was washed in batch with 5 column volumes of wash buffer 
(0.01% LMNG, 0.001% CHS, 20 mM HEPES pH 7.4,500 mM NaCl, 10 mM 
imidazole and 1M ligand). The receptor was eluted in wash buffer 
supplemented with 200 mM imidazole. To test arrestin coupling, rat 
Barrl(1-392) was activated by incubating with V2Rpp and Fab30 before 
the addition of receptor. The V2Rpp peptide: ARGRDTPPpSLGPQDEp- 
SCpTpTApSpSpSLAKDTSS was obtained by custom peptide synthesis 
(Tufts University Core Facility). Fab30 was expressed and purified as 
previously described®. Pre-activated Barrl (5 1M) was then added to 
each receptor and incubated for 30 min at room temperature. Samples 
were then subjected to size-exclusion chromatography (SEC) using a 
Superdex 200 10/300 GL column (GE Healthcare) and monitored by 
the fluorescence of eGFP (excitation wavelength (A,,) 480 nm, emission 
wavelength (A,,,) 512 nm). 


NTSR1 expression and purification 

Full-length wild-type human NTSRI1 was modified with an N-terminal 
Flag tag followed by an octa-histidine tag, and cloned into pFastBacl 
vector. NTSR1 was expressed in Spodoptera frugiperda (Sf9) insect 
cells (Expression Systems) using a FastBac-derived baculovirus. Cells 
were infected at a density of 4 x 10° cells per ml and collected 60h 
post infection. Cells were lysed in hypotonic buffer (10 mM HEPES, pH 
7.4, and protease inhibitors) and solubilized at 4 °C for 2hin a buffer 
containing 1% LMNG, 0.1% CHS, 0.3% sodium cholate (Sigma), 20 mM 
HEPES 7.4, 500 mM NaCl, 25% glycerol, iodoacetamide (to cap cysteine 
residues) and protease inhibitors. Insoluble debris was removed by 
centrifugation and the supernatant was incubated with Ni-NTA (Qia- 
gen) resin for 1h at 4 °C. The resin was washed in batch with buffer 
containing 0.01% LMNG, 0.001% CHS, 0.003% sodium cholate, 20 
mM HEPES pH 7.4, 500 mM NaCl, 10 mM imidazole and eluted with 
the same buffer supplemented with 200 mM imidazole, 2 mM CaCl, 
and 10 uM NTS, ,, (acetate salt, Sigma). The eluate was loaded onto 
M1-Flag immunoaffinity resin and washed with buffer containing 
0.01% LMNG, 0.001% CHS, 0.003% sodium cholate, 20 mM HEPES pH 
7.4,500 mM NaCl, 10 mM imidazole, 0.1 1M NTS,_,, and 2 mM CaCl. 
The receptor was eluted with buffer containing 100 mM NaCl, 20 mM 
HEPES pH 7.4, 0.005% LMNG, 0.005% CHS, 1 1M NTSg_;3, 0.2 mg mI? 
Flag peptide (DYKDDDDK) and 5 mM EDTA. Elution fractions contain- 
ing receptor were pooled and subjected to polishing by size-exclusion 
chromatography on a Superdex 200 Increase 10/300 GL column 
(GE Healthcare) in20 mM HEPES, pH 7.4, 100 mM NaCl, 0.0025% LMNG, 
0.00025% CHS and 0.1 uM NTSg_;3. Peak fractions were pooled and 
concentrated to 200 uM and aliquots were flash-frozen and stored 
at -80 °C until use. 


GRKS expression and purification 

Full-length wild-type human GRKS was modified with a C-terminal 
hexa-histidine tag and cloned into pVL1392 vector for baculovirus pro- 
duction. GRKS5 was expressed and purified as previously described*. In 
brief, Sf9 insect cells (Expression Systems) were infected with a BestBac- 
derived baculovirus at a density of 3.5 x 10° cells per ml and collected 
48 h post-infection. Cells were resuspended, lysed by sonication and 
the supernatant was applied to Ni-NTA resin. The resin was washed 
with lysis buffer and GRKS was eluted with lysis buffer supplemented 
with 200 mM imidazole. The combined eluate was then subjected to 
cation-exchange chromatography using a MonoS 10/100 column (GE 
Healthcare) and eluted with a linear gradient of NaCl. Fractions con- 
taining GRK5 were combined and run on a Superdex 200 10/300 GL 
column (GE Healthcare). GRKS was aliquoted, flash-frozen, and stored 
at -80 °C until use. 


Arrestin expression and purification 

The long splice variant of human, cysteine-free (CS9V/C125S/C140L/ 
C150V/C242V/C251V/C269S) B-arrestin1 was used as full-length arrestin 
for these studies, and modified with an N-terminal hexa-histidine tag, 
followed by a3C protease site, a GG linker, Avitag and GGSGGS linker. 
The sequence was codon- optimized for expression in Escherichia coli 
and cloned into a pET-15b vector. Barr1(ACT) was prepared by using a 
Barrl construct truncated at residue 382. All arrestin constructs used 
were prepared as follows: NiCo21(DE3) competent E. coli (New Eng- 
land Biolabs) were transformed and large-scale cultures were grown 
in TB + ampicillin at 37 °C until an optical density at 600 nm (OD,o9) 
of 1.0 was reached. Cells were then transferred to room temperature 
and induced with 25 uM IPTG when the OD, , reached 2.0. Cells were 
collected 20h post-induction and resuspended in lysis buffer (SO mM 
HEPES pH 7.4,500 mM NaCl, 15% glycerol, 7.13 mM B-mercaptoethanol) 
toafinal volume of 40 ml per litre of cells. Cells werelysed by sonication 
and the clarified lysate applied to nickel sepharose and batch incu- 
bated for 1.5 h at 4 °C. The resin was washed with 10 column volumes 
of wash buffer (20 mM HEPES pH 7.4, 500 mM NaCl, 10% glycerol, 7.13 
mM B£-mercaptoethanol) + 20 mM imidazole, followed by 10 column 
volumes of wash buffer + 40 mM imidazole. The protein was then eluted 
with 5 column volumes of wash buffer + 200 mM imidazole and dialysed 
overnight in 100 volume of dialysis buffer (20 mM HEPES 7.4, 200 
mM NaCl, 2mM B-mercaptoethanol, 10% glycerol) in the presence of 
1:10 (w/w) of 3C protease. The digested protein was then subjected to 
reverse-nickel purification and diluted with dialysis buffer contain- 
ing no NaCl to bring the NaCl concentration to 75 mM. The protein 
was then purified by ion-exchange chromatography (mono Q 10/100 
GL, GE Healthcare), followed by size-exclusion using a Superdex 200 
Increase 10/300 GL column (GE Healthcare) with SEC buffer (20 mM 
HEPES pH 7.4, 300 mM NaCl, 10% glycerol). The purified protein was 
then concentrated to 200-300 uM using a30-kDa spin concentrator 
and aliquots were flash-frozen in liquid nitrogen and stored at -80 °C 
until use. 


NTSRI phosphorylation 

NTSR1 (2.5 uM) was equilibrated in phosphorylation buffer (20 mM 
bis-tris propane (BTP) pH 7.5, 35 mM NaCl,5 mM MgCl, 20 UMNTS,_,;, 
20 uM diC8-PtdIns(4,5)P,, 0.05 mM TCEP, 0.002% MNG, 0.0002% CHS) 
at 25 °C with gentle mixing for 1h. GRK5 was addedto the reaction toa 
final concentration of 200 nM, and briefly incubated while the reaction 
was warmed from 25 °C to 30 °C. ATP was added to afinal concentration 
of 1mM. The reaction was sampled at various time points by removing 
asmall volume, mixing it with EDTA (to 50 mM final) to halt phospho- 
rylation and evaluated by ion-exchange chromatography, which was 
performed as follows: 10 ul of reaction mixture was added to 40 ul of 
buffer A (20 mM BTP pH 8.6, 50 mM NaCl, 0.004% LMNG) andinjected 
onto an equilibrated (100% buffer A) monoQ (5/50) cation exchange 
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column (GE Healthcare). The column was washed with1 column volume 
(CV) of alinear gradient from 0 to 10% buffer B (20 mM BTP pH 8.6,1M 
NaCl, 0.004% LMNG), 6 CV at 10% B, 1CV from 10 to 40% B then 6 CV at 
40% B before being re-equilibrated into buffer A. In some cases, alonger 
linear gradient was used instead of a stepped elution. This involved 1 
CV from 0% B to 5% B, 5 CV 5% B then 20 CV from 5 to 40% B. Protein 
elution was monitored by tracking tryptophan fluorescence at A,, of 
280 nm and J,,, of 340 nm. Upon completion, the reaction was sup- 
plemented with CaCl, toa final concentration of 2 mM and applied to 
an equilibrated M1-Flag immunoaffinity resin and washed with buffer 
containing 0.004% LMNG, 0.004% CHS, 20 mM HEPES pH 7.4,100 mM 
NaCl, 0.2 UMNTS,_,;,2 mM CaCl,. The receptor was eluted with buffer 
containing 100 mM NaCl, 20 mM HEPES pH 7.4, 0.004% LMNG, 0.004% 
CHS, 0.2 uM NTS,.,3, 0.2 mg ml 1x Flag peptide (DYKDDDDK), 5 mM 
EDTA. Elution fractions containing receptor were pooled and used for 
arrestin complexing. 


Analytical fluorescence size-exclusion chromatography for 
construct screening 

Ina final volume of 20 pl, NTSR1 (4.5 iM), the respective arrestin con- 
struct (9 uM) and NTS, ,, peptide (50 pM) were incubated in buffer 
containing 20 mM HEPES pH 7.4, 100 mM NaCl, 0.004% LMNG, 0.0004% 
CHS and 0.2 uM NTS, _,;. Using a Prominence-i LC autosampler (Shi- 
madzu), 10 pl was injected onto aENrich size-exclusion chromatogra- 
phy 65010 x 300 column (Bio-Rad) pre-equilibrated in 20 mM HEPES 
pH 7.4 100 mM NaCl, 0.004% LMNG, 0.004% CHS and 0.2 uM NTS,_43, 
and run at a flow rate of 0.8 ml min”. Tryptophan fluorescence was 
monitored at A, of 280 nm and .,,, of 340 nm. 


Formation and purification of NTSR1-Barrl(ACT) complex 

For cryo-EM, phosphorylated NTSR1 was mixed in an equimolar ratio 
with Barrl(ACT) and diC8-PtdIns(4,5)P, at a concentration of around 
5 uM and supplemented with NTS, ,, to 10 pM final concentration. 
The mixture was incubated at 25 °C for 30 min before being concen- 
trated with a50 kDa molecular weight cut-off concentrator (Amicon 
or Vivaspin) to around 350 pl and purified by size-exclusion chroma- 
tography using two Superdex 200 Increase 10/300 GL columns (GE 
Healthcare) connected in tandem. The mobile phase used was 20 mM 
HEPES pH 7.4, 100 mM NaCl, 0.00075% LMNG, 0.00025% GDN 0.0001% 
CHS and 0.2 uUMNTS,_,. Fractions containing complex were combined 
and diluted to1 1M final concentration, and sulfo-LC-SDA (also known 
as sulfo-NHS-LC-diazirine; sulfosuccinimidyl 6-(4,4’-azipentanamido) 
hexanoate) (Thermo Fisher) as a solution in DMSO was added to 250 pM 
final concentration, and such that the final DMSO concentration was 
below 4%. The reaction was allowed to proceed for 45 min at 25 °C in 
the dark, before hydroxylamine was added toa final concentration of 3 
mM and incubated for an additional 15 min. Thesample was distributed 
into a clear 96-well plate (90 ul per well), put on ice and irradiated for 
45 min using a UVL-56 lamp. The sample was then pooled and concen- 
trated to around 500 ul then re-run on size-exclusion chromatogra- 
phy, again using two Superdex 200 Increase 10/300 GL columns (GE 
Healthcare) connected in tandem. Peak fractions were combined and 
concentrated with a 100 kDa molecular weight cut-off concentrator 
(Amicon) toa final concentration of 4.5 mg mI. 


Cryo-EM sample preparation and image acquisition 

Analiquot of 3.5 pI NTSR1-Barrl(ACT) sample was deposited onto glow- 
discharged 200 mesh grids (Quantifoil R1.2/1.3) and plunge-frozen into 
liquid ethane using an FEI Vitrobot Mark IV (Thermo Fisher Scientific). 
Several sessions of data collection were conducted onthe same Titan 
Krios equipped with an energy filter and operated at 300 keV using a 
nominal magnification of 130,000. Movies were captured using a 
Gatan K2 Summit direct electron detector in counted mode, which 
resulted in a pixel size of 1.06 A. Movie stacks were obtained witha 
defocus range of -1.0 to -2.0 xm, using SerialEM 3.7.10“ with a set of 


customized scripts enabling automated low-dose image acquisition. 
Each movie stack was recorded for atotal of 8s with 0.2s per frame. The 
exposure rate was seven electrons per pixel per second. Initial datasets 
were collected by one exposure per hole per single stage movement 
but subsequent collections used nine-hole exposures per single stage 
movement with beam-tilt compensated by using multi-record strategy 
implemented in SerialEM 3.7.10. 


Cryo-EM data processing 

A total of 18,797 image stacks were subjected to beam-induced motion 
correction using MotionCor2”. Contrast transfer function parameters 
for each micrograph were estimated from the exposure-weighted aver- 
ages of all frames by Gctf v.1.06°. The following processes were all per- 
formed using RELION3.0%, except those mentioned specifically. After 
2D classification, 2,628,700 particles were divided into 6 subsets for 
3D classification. A reference map for 3D classification was generated 
by the ‘3D initial model’ script in RELION 3.0 using default stochastic 
gradient descent parameters. All stable classes were then combined for 
3D refinement, which ledtoa4.5 A map. Further 3D classification of this 
particle set resulted in two best classes with a total of 263,965 particles. 
The resulting Bayesian polished particles were used to build the final 
map withan overall resolution of 4.2 A, as determined by Fourier shell 
correlation (FSC) using a cutoff of 0.143. To probe the conformational 
dynamics, multi-body refinement was performed onthe 4.2 A map 
with two bodies corresponding to NTSR1 and fBarrl. Local resolution 
was estimated with the Bsoft 2.0.6 package” (Extended Data Fig. 6b). 


Model building and refinement 

The initial template for NTSR1-Barr1(ACT) was built from the receptor 
coordinates of the crystal structure of NTSRI-NTS, ,, (RCSB Protein 
Data Bank (PDB): 4GRV) and arrestin coordinates fromthe V2Rpp-Barrl 
structure (PDB: 4JQI). The initial coordinates for the NTS,_,, ligand 
were taken from the NTSRI-NTS, ,, crystal structure (PDB: 4GRV). 
Models were docked into the electron microscopy density map using 
UCSF Chimera 1.14°°, then refined by several iterations of automated 
refinement in PHENIX interspersed with manual adjustments in Coot 
0.8.9°*. The NTSRI C terminus was built on the basis of the structure 
of V2Rpp bound to Barrl (PDB: 4JQI). The final model was subjected 
to global refinement and minimization in real space using phenix.real_ 
space _refine in PHENIX v.1.16°°. MolProbity 4.5 was used to evaluate 
model geometry® (Extended Data Table 1). FSC curves were calculated 
between the resulting model and the half map used for refinement as 
well as between the resulting model and the other half map for cross- 
validation (Extended Data Fig. 6a). 


Phosphoproteomics experiments 

Receptor samples were denatured, reduced, alkylated and digested 
according to the manufacturers’ protocols (Protifi). In brief, 5-10 pg 
of receptor samplein 0.004% LMNG/0.0004% CHS buffer was diluted 
twofold with buffer containing approximately 10% SDS and 100 mM 
triethylammonium bicarbonate (TEAB, Sigma) pH 7.6. Cysteine resi- 
dues were reduced by the addition of freshly dissolved dithiothreitol 
(Sigma) to 20 mM final concentration and heated to 95 °C for 10 min. 
Samples were cooled to room temperature and iodoacetamide (Sigma) 
was added to 40 mM final concentration. Samples were incubated 
for 30 min with light excluded at 25 °C with gentle agitation using a 
Thermomixer. Phosphoric acid (12%) was added to lower the pH for 
binding to the S-trap column, followed by binding buffer: 90% metha- 
nol, 100 mM TEAB pH7.1. The sample was then added to the S-trap and 
sequentially spun at 4,000¢ until all material was loaded discarding 
the flow-through. The column was washed three times with binding 
buffer, as above. Freshly reconstituted Trypsin/LysC (Promega) was 
diluted into SO mM TEAB pH 8 supplemented with 0.02% protease 
max (Promega) and added to the S-trap (Protifi). The column was 
placed inside a clean microcentrifuge tube and incubated at 47 °Cina 


Thermomixer, without agitation, for one hour. After digestion peptides 
were sequentially eluted by spinning through 40 pl of SO mM TEAB pH 
8,40 pl of 0.1% formic acid and 35 pl of 50% aqueous acetonitrile with 
0.2% formic acid. The resulting solution was combined and evaporated 
with aSpeedVac overnight. 

Dried peptides were reconstituted in 0.1% formic acid and injected 
onto a reverse phase C18 lonOpticks 25 cm Aurora column using a 
nanoElute (Bruker Daltonics). The target on-column load was 200 
ng total peptide per injection. The flow rate was 400 nl min™. Mobile 
phase A was 0.1% formic acid, 2% acetonitrile and 97.9% water; mobile 
phase B was 0.1% formic acid and 99.9% acetonitrile. The gradient was 
linear from 4% B to 36% B over 30 min. The mass spectrometer was a 
timsTOF Pro (Bruker Daltonics) set to acquire data in PASEF (paral- 
lel accumulation serial fragmentation) mode. The TIMS (trapped ion 
mobility spectrometry) ion accumulation time was set to 100 ms and 
precursors were pushed at 2.7 ms. The dynamic exclusion settings were 
set to exclude the same precursor masses for 0.4 min where charge 
states of O-5 were allowed. For each sample three technical replicates 
were run and analysed independently. 

Raw .d files from the timsTOF Pro were searched with Byonic v3.4 
(Protein Metrics) using a custom FASTA database, which included 
the sequences of the human NTSRI1 and human cysteine-free Barr1. 
The enzyme was trypsin (KR) and was set to fully specific, allowing 
for two missed cleavages and constrained to a false-discovery rate 
of less than 1%. The precursor mass tolerance was set to 20 ppm and 
the fragment ion tolerance at 40 ppm. A maximum of six common 
modifications and one rare modification were allowed per peptide, 
and the following modifications were included in the search: C carba- 
midomethyl (+57.021464) as a fixed modification; N-terminal acetyl 
(+42.010565) as rare 1; Nand Q deamidated (+0.984016), as rare 1;M 
oxidation, (+15.994915) as common 1, ST phosphorylation (+79.966331) 
as common 4; Y phosphorylation (+79.966331]) as rare 1; C palmitoyl 
(+238.229666) as rare 1. The resulting Byonic results were imported 
into Byologic v.3.4-55 for quantitative analysis. 


Lipidomics analysis of PtdIns(4,5)P, 

Measurements of diC8-PtdIns(4,5)P, were performed using aq-Exactive 
FT-mass spectrometer (Thermo) equipped with Vanquish Split Sam- 
pler FT uHPLC. Samples were diluted in 50% aqueous methanol and 
injected ontoa Dionex Acclaim 120 C8 column (5 um particle size, 120 A 
pore diameter, 2.1 mminternal diameter, 50 mm length). The column 
compartment was maintained at 29 °C. The flow rate was 0.3 ml min? 
Mobile phase A was 1:1 (v/v%) MeOH/water, 5 mM ammonium acetate; 
mobile phase B was pure acetone. Eachrun was five minutes; the gradi- 
ent used was as follows: 0-0.5 min 0% B, 0.5 to 2 min linear gradient 
from 0 to 50% B, 2 to 3.5 min hold at 50% B, 3.5 to 3.6 min from 50% B 
to 0% Band 3.6 to 5 min at 0% B to re-equilibrate the column. The first 
minute of the run was diverted to waste and minutes 1-4.8 were sent 
to the mass spectrometer for analysis. Detection on the q-Exactive 
was performed in negative mode between 200-2,000 m/z, using an 
acquisition target of 3 x 10°, maximum injection time of 100 ms at a 
resolution of 70,000 for MS and 17,500 for MS/MS (tandem MS) data. 
For MS/MS analysis, dynamic exclusion was set to 10s and a global 
inclusion list was used to target the 745.2 m/z species. Automatic gain 
control target was set to1 x 10°, maximum injection time was 100 ms for 
MS1and 50 ms for MS2. Astepped collision energy from 25 eV to35eV 
(normalized) and an isolation window of 4 m/z was used for higher- 
energy collision-induced dissociation. 


Crosslinking mass spectrometry 

Excised SDS-PAGE slices corresponding to protein or crosslinked pro- 
teins of interest were destained and diced to 1-mm pieces before reduc- 
tion with 5 mM dithiothreitol for 30 minat 55 °C, followed by alkylation 
with 10 mM acrylamide and finally digestion with trypsin/LysC (Pro- 
mega). Digestion was performed at an estimated enzyme:substrate 


ratio of 1:100 overnight at 37 °C. After proteolysis, the reaction was 
quenched with 1% formic acid. Peptides were extracted and dried using 
a SpeedVac before dissolving in 12 pl of reconstitution buffer (2% ace- 
tonitrile with 0.1% formic acid); 3 pl of this solution was injected onto 
the LC-MS. Ina typical mass spectrometry experiment, an Orbitrap 
Fusion Tribrid mass spectrometer (Thermo Scientific) with liquid 
chromatography using a NanoAcquity UPLC (Waters Corporation) 
was used ata flow rate of 450 nl min”. Reverse-phase chromatographic 
separations consisted of mobile phase A (0.2% formic acid in water) and 
mobile phase B (0.2% formic acid in acetonitrile). The stationary phase 
consisted of fused silica columns—pulled and packed in-house—with 
an internal diameter of 100 pm packed with Magic 1.8-um particle size, 
120 Apore diameter, UChrom C18 (nanoLCMS Solutions) at alength 
of around 25 cm. Peptides were directly injected onto the analytical 
column using a gradient (2—45% B, followed by a high-B wash) of length 
80 min. The mass spectrometer was operated in a data-dependent 
fashion using either collision-induced decay (CID) fragmentation or 
higher-energy collisional dissociation (HCD)/electron-transfer dis- 
sociation (ETD) decision tree fragmentation for MS/MS spectra. For 
CID and ETD, MS/MS spectra were detected inthe ion trap and for HCD, 
MS/MS spectra were detected in the Orbitrap. For data analysis, .RAW 
data files were processed using Byonic v3.2.0 (Protein Metrics) toiden- 
tify peptides from a limited database of protein target sequences (as 
described inthesection‘Phosphoproteomics experiments’). Peptides 
were assumed tobe fully specific, allowing for common modifications 
(for example, oxidation of methionine), allowing for up to two missed 
cleavages and were constrained to a false-discovery rate of less than 2%. 
Mass accuracies for precursor and peptide fragments detected inthe 
Orbitrap were held within 12 ppm, with 0.4-Da mass accuracies in the 
iontrap. Potential crosslinked peptides were validated using Byologic 
v.3.4-55 (Protein Metrics). Search results were evaluated in Byologic and 
evaluated relative to the structural model of the NTSR1-Barrl complex. 


Fluorescence anisotropy measurements 

BODIPY TMR phosphatidylinositol 4,5-bisphosphate (Echelon Bio- 
sciences) (henceforth BODIPY-PtdIns(4,5)P,) was dissolved to a stock 
concentration of 1mMin5O mMHEPES pH 7.4 and used ata final concen- 
tration of 4 nM in the assay. For the arrestin measurements, a twofold 
dilution series of was made froma 311 uM stock of Barr1(ACT), yielding 
fourteen samples with final concentrations ranging from 150 pM to 
0.02 uM. For the NTSR1 measurements, a twofold dilution series was 
made from a 140 uM stock of NTSRI1, yielding fourteen samples with 
final concentrations ranging from 67 LM to 0.01 pM. For each, acontrol 
sample containing buffer only was included to measure the free anisot- 
ropy of BODIPY-PtdIns(4,5)P,. After mixing the BODIPY-PtdIns(4,5) 
P, with the receptor, arrestin or buffer, samples were incubated for 1h 
at room temperature before measurements. Samples were measured 
in five 20-1 (arrestin) or three 10-pl (receptor) replicates ina384-well 
plate ona Tecan Infinite M1000 (Tecan Life Sciences), using an excita- 
tion wavelength of 530 nm, an emission wavelength of 573 nm and 
bandwidths of 5 nm. The obtained data was fit using ‘One Site Total’ 
nonlinear regression using GraphPad Prism 8.3.0. 


NanoBiT Parrl1 recruitment assay 

Recruitment of Barrl to NTSR1 was measured by aNanoBiT PPI assay”. 
Human full-length Barrl was N-terminally fused to a large fragment 
(LgBiT; forming Lg-Barr1) of the NanoBiT luciferase with a 15-amino- 
acid flexible linker (GGSGGGGSGGSSSGG). N-terminally Flag-epitope 
(DYKDDDDK)-tagged human NTSRI was C-terminally fused toa small 
fragment (SmBiT; forming NTSR1-Sm) with the 15-amino-acid flexible 
linker. The Lg-Barrl and the NTSR1-Sm constructs were inserted into 
a pCAGGS expression plasmid vector. HEK293A cells (Thermo Fisher 
Scientific, confirmed mycoplasma negative by manufacturer) were 
seeded in a 6-well culture plate at a concentration of 2 x 10° cells per 
ml (2 ml per well in DMEM (Nissui Pharmaceutical) supplemented with 
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10% fetal bovine serum (Gibco), glutamine, penicillin and streptomy- 
cin) 1 day before transfection. Transfection solution was prepared by 
combining 4 pl (per well in a 6-well plate, hereafter) of polyethylenimine 
solution (Polysciences; 1mg mI) anda plasmid mixture consisting of 
100ngLg-Barrland500 ng NTSR1-Sm in 200 ul of Opti- MEM (Thermo 
Fisher Scientific). After incubation for 1 day, transfected cells were 
collected with 0.5 mM EDTA-containing Dulbecco’s PBS, centrifuged 
and suspended in2 ml of HBSS containing 0.01% bovine serum albumin 
(BSA; fatty-acid-free grade; SERVA) and 5 mM HEPES (pH 7.4) (assay 
buffer). The cell suspension was dispensed in a white 96-well plate at 
a volume of 80 pl per well and loaded with 20 ul of 50 uM coelentera- 
zine (Carbosynth) diluted in the assay buffer. After 2-h incubation at 
room temperature, the plate was measured for baseline luminescence 
(Spectramax L, Molecular Devices) and 20 pl of 6x neurotensin (Peptide 
Institute; final concentrations ranging from 0.1 nM to 10 pM) diluted 
in the assay buffer or the assay buffer alone (vehicle) were manually 
added. The plate was read for 15 min with an interval of 20 s at room 
temperature. Luminescence counts were normalized to the initial 
count and arrestin association kinetics were calculated by fitting the 
normalized luminescence data to a one-phase association model built 
in Prism 8 software (GraphPad Prism). In cases in which luminescent 
kinetics were bell-shaped, data points over increasing luminescence 
were used for the fitting. Association speed was calculated bya formula 
of (plateau — Y,)*K in which ‘plateau’ and Y, represent saturated and 
initial luminescent counts, respectively, and K denotes arate constant 
inunits that are the reciprocal of time (min). The resulting association 
speed data were fitted to a four-parameter sigmoidal concentration- 
response curve, from which pEC,, values (negative logarithmic values 
of EC,, values, in which EC,, is the half-maximal effective concentration) 
and £,,,, values (‘top’ —‘bottom’) were used to calculate the mean and 
standard deviation. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 

The cryo-EM density map for the NTSR1-Barrl(ACT) complex has been 
deposited inthe Electron Microscopy Data Bank under accession code 
EMD-20836. The coordinates for the model of NTSR1-Barrl(ACT) have 
been deposited in the PDB under accession number 6UP7. Proteom- 
ics data have been deposited in the ProteomeXchange Consortium 
via the PRIDE partner repository and can be accessed at https://doi. 
org/10.6019/PXD016224. The data supporting the findings of this 
study are available as Supplementary Information (Supplementary 
Tables 1-39 and Supplementary Figs. 2-5). All other data are available 
upon request to the corresponding authors. 
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Extended Data Fig. 1| Screening for GPCR-Barrl complexes. A fluorescence 
size-exclusion chromatography (FSEC) approach was used to identify GPCRs 
that display strong coupling to Barrlin vitro. a, Overview of the screening 
strategy. Small-scale transfection and purification of candidate GFP-tagged 
GPCRs was followed by complexing with activated Barrl, denoted by Barrl*. 
Barrl was activated in situ by mixing with Fab30 and V2Rpp (Methods) before 
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being added (in excess) to the agonist-bound receptor. Complex formation was 
assessed by FSEC. b, Representative FSEC traces. The trace of NTSR1 + Barrl* 
(left) is notable in that it shows a shoulder peak, whichindicates substantial 
complex formation, whereas the trace for B, adrenergic receptor (B,AR, right) + 
Barrl* shows little or no complex formation. Experiments were performed 
independently twice, with similar results. 
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Extended Data Fig. 2| Phosphorylation of NTSR1is crucial for the coupling 
of Barrl.a, The phosphorylation state of NTSR1 was assessed usingion- 
exchange chromatography. Chromatograms of aliquots taken at various time 
points (7, shown in minutes) are overlaid, and the relative amounts of 
unphosphorylated and phosphorylated species were measured using an 
optimized stepped elution profile. This experiment was performed once. 

b, Graphical representation of normalized relative ratio of phosphorylated and 
unphosphorylated receptor from data shown ina.c, d, FSEC chromatogram for 
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screening Barrl constructs for forming a stable complex with either 
unphosphorylated (c) or GRK5-phosphorylated (d) NTSRI1. Elution volumes for 
free receptor, free Barrl and the receptor-Barrl complex are shown. The 
unphosphorylated receptor does not form acomplex with any of the Barrl 
constructs tested, whereas the phosphorylated receptor couples moderately 
to full-length Barrl and strongly to the pre-activated Barrl(ACT) construct. 
Experiments were performed independently twice, with similar results. 
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Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | Overview of sample preparation and representative 
cryo-EM preparation. a, Workflow for the assembly of the NTSR1-Barrl(ACT) 
complex. Receptor, GRKS and Barrl(ACT) were purified separately. 
Phosphorylation of NTSR1 by GRKS in the presence of diC8-PtdIns(4,5)P, was 
monitored by analytical ion-exchange chromatography (shown in inset as 
gradient elution froma representative preparation; a stepped elutionion- 
exchange chromatography run is shown in Extended Data Fig. 2a), followed by 
M1 affinity purification. The phosphorylated receptor was combined with 
Barrl(ACT) and additional diC8-PtdIns(4,5)P, and separated by size-exclusion 
chromatography (SEC) toisolate the NTSR1-farrl(ACT) complex. The complex 
was labelled with sulfo-LC-SDA and irradiated with UV light. After asecond 
round of SEC to re-isolate the complex, the sample was concentrated and used 
for cryo-EM. b, Representative sample preparation: phosphorylated NTSR1 


was mixed with Barrl(ACT) and purified by SEC (blue curve, SEC 1). Complex 
fractions were combined and treated with crosslinker, quenched, then UV- 
irradiated. Lanes 2 and 3 of the SDS-PAGE gel show pooled samples before and 
after crosslinking and UV irradiation; about 25% of the sample is crosslinked 
(based on densitometry, top two bands relative to total). The crosslinked 
sample was re-run on SEC (red curve, SEC2) and SDS-PAGE of peak fractions is 
shown. Peak fractions were combined, concentrated and used for cryo-EM. 
Representative negative-stain electron microscopy images of the NTSRI1- 
Barrl(ACT) complex show similar homogeneity, pre- and post-crosslinking. 
Enlarged views of rotated representative particles (denoted by yellow and blue 
circles) are shown. Four sample preparations, which all gave similar results by 
SEC, SDS-PAGE and negative-stain analysis, were used to generate the cryo-EM 
dataset. 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4 | Mass spectrometry analysis of crosslinks inthe 
NTSRI1-Barr1(ACT) complex. a, Lysine residues modified within the NTSRI- 
Barrl(ACT) complex after treatment with sulfo-LC-SDA wereidentified by mass 
spectrometry. Although mass spectrometry data were collected for each band 
observed by SDS-PAGE separately (Extended Data Fig. 3b), data were 
combined for analysis as the overall sample was imaged by cryo-EM. For each 
lysine residue in the complex the degree of modification was approximated as 
the percentage of times a lysine was found to be modified relative to the total 
number of times that lysine is observed across all peptides. The Ca for lysine 
residues that were observed to be modified less than 30% of the time are shown 
as grey spheres, whereas those found to be modified more than 30% of the time 
areshownas green spheres. Atabulated analysis is presented in 
Supplementary Tables 1-7. b, Of the modified lysine residues, several mainly 
formed dead-end crosslinks (reaction of the activated diazirene with water); 


those with more than 30% of the lysine residues resulting in dead-end 
crosslinks are shown as maroon spheres (Ca).c, Only one intermolecular 
crosslink is observed that is consistent with the structural model obtained 
from cryo-EM. This crosslink was localized to ICL1(K91) anda B-strand (shown 
in green) in the arrestin C-lobe.d, Several intramolecular crosslinks within ICL2 
and the adjacent helices are observed. Reactive lysine residues are shownas 
greenspheres (for Ca), and crosslinks were localized to the region shown in 
green.e, Multiple intramolecular crosslinks within arrestin were observed. 
Each reactive lysine residue (shown asa Ca sphere) that forms an intrapeptide 
crosslink, or an intramolecular (within arrestin) crosslink, is coloured to match 
the regionin which the observed crosslink could be localized. Each band was 
analysed once by mass spectrometry froma representative cryo-EM 
preparation. 
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Extended Data Fig. 5 | Cryo-EM workflow. Flow chart of cryo-EM data analysis. Global resolution was determined by FSC witha cut-off of 0.143. The mask used for 
sharpening was also used for global resolution determination. 
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Extended Data Fig. 6| Cryo-EM map resolution and model validation, with 
multi-body analysis revealing structural heterogeneity within the NTSR1- 
Barr1(ACT) complex. a, Cross validation between the map and the model. The 
model was refined against one half map after displacement of atoms by 0.2A, 
and FSC curves were calculated between this model and the working half1map 
(red), the free half2 map (green) and the final cryo-EM map (full dataset, black) 
by Mtriage implemented in PHENIX. b, Local resolution of the final 4.2 Amap 
was estimated by Bsoft.c, Euler angle distribution of the particle set used inthe 
final map. d, Representative sections of the model with accompanying regions 
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of density from the electron microscopy map. e, Masks used for multi-body 
refinement. f, The contributions of each of the twelve eigenvectors (numbered 
along thex-axis) to the variance of the overall final map. g, Maps corresponding 
tothe second and ninthcomponents of the first eigenvector (f) are aligned, 
showing swing-like motion of Barrl(ACT) with respect to NTSR1.h, Maps 
corresponding to first and tenth components of the second eigenvector (f) are 
superimposed, indicating the tilt-like motion of Barrl(ACT) with respect to 
NTSRI. 
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Extended Data Fig. 7 | Arrestin orientation and interdomain twist. a, 
Additional views of Fig. 1c, showing an overlay between NTSR1-Barrl(ACT) and 
Rho-Arrl: cytoplasmic (intracellular) view (left), side view (middle) and 
extracellular view (right). b, Compared with the inactive structure, Barrl(ACT) 
in complex with NTSR1 displays an additional interdomain twist between its 
N-lobeand its C-lobe, whichis a hallmark of activation. Comparison of the 
interdomaintwist between NTSR1-farrl(ACT) (blue, present structure) and 
V2Rpp-farrl (orange, PDB: 4JQI) relative to inactive Barrl (wheat, PDB: 1G4M). 
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Asmaller interdomain twist and displacement between respective C-lobes is 
observed for NTSR1-bound Barrl (16°, 3.6 A) compared with V2Rpp-bound Barrl 
(22°, 4.6 A), indicated by arrows drawn to relative scale. Asimilar comparison 
for rhodopsin-bound arrestin-1 (green, PDB: SWOP) and pre-activated p44 
arrestin-1 (purple, PDB: 4J2Q) relative to inactive arrestin-1 (pink, PDB: 1CF1) 
yields interdomain twist and displacement values of 20°, 5.4 Aand 23°, 6.1A, 
respectively. 
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Extended Data Fig. 8 | See next page for caption. 


Extended Data Fig. 8 | Identification of GRK5 phosphorylation sitesin 
humanNTSR1and comparison of phosphate-specific contacts in NTSRI1- 
Barr1(ACT) to thosein Rho-Arrl and V2Rpp-farrl1. a, Snake diagram of 
human NTSRI1 showing identified phosphosites in ICL3 and the receptor C 
terminus. Thetrypsin/LysC digestion sites are shown in light blue. All four 
phosphosites for ICL3 are ina single peptide beginning with QAA at the end of 
TMS and ending at PGRat the beginning of TM6. Phosphosites in the C terminus 
occur on two peptides, one containing S396, RPAFSR, as wellas several missed 
cleavage variants; and one containing S401, S403, S404, T407 and T1413. 
Tabulated fragment ions are presented in Supplementary Tables 8-39 and 
spectra are shown in Supplementary Figs. 2-5. Any post translational 
modifications outside of these two discussed regions are not shown for clarity. 


These experiments were performed independently twice, and analysed by two 
or three technical replicates each time, all with similar results. b, Overlay of 
arrestin structures, in which binding of phosphorylated receptor residues 
contributes to stabilizing the active state of arrestin. Left, equivalent 
phosphoresidues on different receptor C-tails (shown in stick representation 
and coloured accordingly: NTSR1, yellow; V2Rpp, red; rhodopsin, pink) 
contribute to arrestin activation by interacting with a lysine residue on the gate 
loop (K294, shown as Ca spheres), thereby breaking the polar core. Right, 
phosphoresidue pT347 in the V2Rpp interacts with residue K77 at the base of 
the finger loop. A similarly positioned phosphoresidue in the ICL3 of NTSR1 
(pS287) probably interacts with arrestin residues R76/K77 ina similar manner. 
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Extended Data Fig. 9 | Membrane context and curvature for NTSR1- inthe NTSR1-Barrl(ACT) structure could also be relevant to the increased 
Barr1(ACT).a, Comparison of the structures of NTSR1-fBarrl(ACT) and NTSR1- membrane curvature observedin endocytic vesicles. As an example, several 
cG,, for which membrane-embedded regions were computed using the membrane bilayers are drawn (in red) to scale, comparing the degree of 
Orientations of Proteins in Membranes server (https://opm.phar.umich.edu/). curvature for vesicles ranging from 120 nmto50 nmin diameter relativetoa 
The membraneis indicated by grey spheres to showthe extent to which the planar bilayer. 


C-edge would insert into the membrane. b, The extent of arrestin tilt observed 


Extended Data Table 1| Cryo-EM data collection, refinement and validation statistics 


Cryo-EM data collection, refinement and validation statistics 


Data collection and processing 
Magnification 
Voltage (kV) 
Electron exposure (e-/A?) 
Defocus range (1m) 
Pixel size (A) 
Symmetry imposed 
Initial particle images (no.) 
Final particle images (no.) 
Map resolution (A) 

FSC threshold 
Map resolution range (A) 


Refinement 
Initial model used (PDB code) 
Map sharpening B factor (A?) 
Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 
B factors (A?) 
Protein 
Ligand 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 
Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 


NTSRI1-farrl(ACT) 
(EMDB-20836) 
(PDB 6UP7) 


130,000 
300 


12,365,655 
254,725 


4AGRV,4JQI 
-164 


4166 
685 
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Data exclusions No data was systematically excluded. MS search results were manually inspected to remove false positive IDs. The process of generating 3D 
maps from cryo-EM particles involves sorting for particles that are damaged, have low signal, or are in minority conformations that are 
unlikely to refine correctly. This is implemented in Relion 2.1 


Replication All attempts at replication of biochemical and signaling assays succeeded. For cryoEM, data was collected over several rounds of data 
collection using multiple samples prepared on different days and with different input materials. NanoBiT G-protein dissociation assay shows 


mean +/- SD from 5 independent experiments performed in duplicate (technical replicates of the biological replicates). 
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could be controlled. 


Blinding No blinding was attempted or needed. There was no group allocation performed in this study. 
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JARGON SHUTS 
READERS OUT 


Non-scientists feel confused by technical 
language — evenifit’s defined. By Chris Woolston 


verly technical language in science 
articles doesn’tjust muddy the waters 
for non-experts — it can alienate 
readers, potentially shutting them 
out from scientific discussion and 
knowledge. That’s the conclusion of a study 
published inthe/ournal of Language and Social 
Psychology’, and it applies to general-interest 
articles just as muchas to scientific papers. 

“When we havea hardtime conceptualizing 
information, we become really scared of it,” 
says lead author Hillary Shulman, acommuni- 
cation researcher at the Ohio State University 
in Columbus. Scientists can create “unneces- 
sary barriers” with words, she says. 

The study involved 650 members of the 
general public who read paragraphs on three 
topics: self-driving cars, robotic surgery and 
3D bio-printing. The paragraphs were either 


laden with jargon terms, such as “remote 
ergonomic console”, or written with words that 
are familiar to most readers, suchas “separate 
control panel”. Shulman and her co-authors 
wrote the texts using jargon gathered from 
articles and websites aimed at all readers, not 
fromscientific journals or technical manuals. 

After reading the passages, the study par- 
ticipants rated their experience ina series of 
questionnaires. Those who read jargon-filled 
paragraphs were more likely to say that they 
had difficulty understanding the language and 
the information. They were also significantly 
more likely to say that they weren't good at 
science, and less likely to say that they would 
seek out information onthe topic in the future. 

Some of the participants who read the 
jargon-heavy text received links to defini- 
tions of technical terms, but that didn’t reduce 
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their frustrations or enhance their feelings of 
understanding. “We found that people didn’t 
use the links,” Shulman says. Instead of trying 
to define technical language when communi- 
cating with non-experts, she says, scientists 
would do better to avoid any such terms. 

Scientists can learn to cut back on their 
use of technical language when talking to 
people who are not researchers, says Ayelet 
Baram-Tsabari, a science-communication 
researcher at the Technion Israel Institute of 
Technology in Haifa. In 2017, she helped to 
develop the De-Jargonizer, an online tool that 
assesses and scores the accessibility of text”. 

Baram-Tsabari also co-authored aJanuary 
study in PLoS ONE? showing that scientists with 
media training can write articles that are just 
as engaging as pieces written by professional 
journalists. “Avoiding jargon is afundamental 
part of that, but it’s not the whole story,” she 
says. To really connect with the public, she 
recommends that scientists tell a story that’s 
relevant to the audience. 

Members of the public aren’t the only ones 
who can be turned off by jargon, Shulman 
says. Students can be, too. “Iteachaclass with 
400 undergrads,” she says. “When you’retrain- 
ing people, you can introduce jargon witha 
little more sensitivity. You’re trying to invite 
them into the environment.” 

Of course, technical words still have an 
important function in science. Shulman’s 
paper is itself loaded with terms such as 
‘metacognition’ and ‘self-schema’. “The irony 
of that is not lost on me,” she says. “When it 
comes to scientific literature, you can’t get 
anything published unless it’s full of jargon. 
Scientists want to speak to other scientists in 
the most precise way possible.” 

Baram-Tsabari says that one of her gradu- 
ate students has gathered anecdotal evidence 
that some female researchers feel especially 
pressured to use heavily technical words and 
phrases. “They say, ‘People don’t take me seri- 
ously because I’m a woman. If |used accessible 
language, it would be bad for my career.” Sim- 
ilarly, Shulman has noticed that early-career 
researchers tend to lean on technical language 
to show that they belong in the community. 
“I see it alot in graduate students,” she says. 


Chris Woolston is a freelance writer in Billings, 
Montana. 
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Work / Careers 


INTERCONTINENTAL 
CULTURE CLASH 


Amove from Germany to Mexico was a crash course in 
acclimatizing to anew research environment. By Deb Raj Aryal 


n 2010, after obtaining my master’s 

degree in Germany, I moved to Mexico to 

pursue my PhDin ecology and sustainable 

development. In both places, far away from 

my home country of Nepal, | worked with 
scientists and other colleagues from different 
cultures. This experience greatly improved my 
interpersonal skills and broadened my view of 
the world of research. 

After Icompleted my PhD programme in 
2015, Istarted a research fellowship at Chiapas 
Autonomous University in Mexico. My work 
focused on the environmental impact of agri- 
culture and livestock productioninthe tropics 
and subtropics. It was part of a programme 
set up by the Mexican National Science and 
Technology Council (CONACYT) to attract 
foreign researchers and to encourage Mexi- 
can scientists based abroad to return to the 
country. The programme ran from June 2014 
to December 2018 and attracted more than 
1,300 early-career researchers. 

Iwas conscious that the arrival of so many 
scientists over four years in a country with 
limited research funding might make for an 
uncomfortable situation for the established 
scientists already working there. CONACYT 
research fellows would be absorbed as staff 
scientists or faculty members into their host 
institutes within ten years, although this was 
not obligatory. As someone who was part of 
this large cohort of researchers, I have some 
advice on howto adapt to anew working envi- 
ronment to the benefit of both yourself and 
your colleagues. 


Be humble 


It is an advantage to be an early-career 
scientist with a lot of energy and enthusiasm 
for carrying out cutting-edge research with 
the latest tools and methods — but it’s impor- 
tant to not come across as arrogant. An over- 
confident or pompous attitude can result in 
colleagues not wanting to cooperate or col- 
laborate with you. At the beginning of their 
careers, researchers are most successful when 
they are humble and valuethework of others, 
in my view. 


Respect differences 


Differences in work styles or cultures can 
feel more significant if you come from a 
different continent. Generally, new postdocs 
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or principal investigators at an institute 
should not aim to change the work culture 
of colleagues who have been at that place 
for years. Unless there is unfair treatment 
or harmful behaviour, try instead to adapt 
tothe newculture. There could be moments 
in which you feel uncomfortable with the 
existing dynamics and working styles, but 


“It’s okay that you 
don’tknowthe 
local norms and 
regulations at first.” 


it’s important to accept that everyone has 
their own way of working. Constant and open 
communication regarding your worries and 
anxieties can help to create a shared sense of 
well-being in cross-cultural research labs. 


Handle questions calmly 

Scientists with many years of experience in 
your field might know more than you do. 
If you are asked a question that you feel is 
hostile, don’t take it as a personal attack on 
the integrity of your work. Instead, thank 
the researcher who asked it, but be honest 
and say that you’re still learning. As an 
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early-career researcher, you are not expected 
to knoweverything. If you find yourself being 
compared with others, don’ttry to prove that 
you are better. Your research should speak for 
itself. Sometimes, one group of researchers 
might disagree with another on howtotackle 
a project or share lab equipment, for example. 
Taking the side of the one group by criticizing 
the other won't help scientists who have just 
started ajob at the institute. 


Believe in yourself 

Have faith in your capability to adjust, to 
work well with others and to deliver onthe 
research responsibilities you are assigned. It’s 
okay that you don’t know the local norms and 
regulations at first. Use common sense and 
think about how you might want to be treated. 
If a question comes up that you don’t know 
the answer to, ask someone; many people are 
willing to help newcomers. It is important to 
havea strong and lasting motivation for your 
career in science. It’s fine if your first attempt 
at your research project doesn’t succeed — but 
you must persevere when you fail. 


Deb Raj Aryal is a CONACYT research fellow 
at the Autonomous University of Chiapas in 
Mexico. 

e-mail: drajar@conacyt.mx 
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study how plants adapt to some of the 
most extreme conditions on Earth. My 
particular focus is the South African 
‘resurrection’ plant (Myrothamnus 
flabellifolia), which can survive for 
years ina completely desiccated state. 
Understanding what makes these plants so 
resilient could help us to develop crops that 
will survive drought. 

l often travel to the Buffelskloof Nature 
Reserve, in the Mpumalanga province of 
South Africa, to measure and take samples 
of M. flabellifolia that grow there, ona 
100-metre quartzite cliff. In this picture, 
where I’m about about to abseil down it, I’m 
recording the plants’ height, sex, number 
of flowers, hydration status and soil depth. 
laim to understand how their structure, life 
history and drought tolerance change over 
time, and with distance up the cliff face. 

I'd been rock climbing for more than a 
decade before I began researching these 
plants. I never had any formal training in 
abseiling, but friends and mentors taught me 
the basics, and I learnt through experience. 
I’ve broken both ankles and once fell 
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15 metres before the rope caught me. 
Atsome point, I realized that I could use 
my abseiling skills to reach rarely studied 
plant communities in extreme habitats. 
I’ve found that the flowers of M. flabellifolia 
are also desiccation-tolerant and that the 
plant dries at different rates on the cliff, 
depending on light exposure and on the 
depth and water-holding capacity of the soil. 
My field site is 1,700 kilometres from my 
laboratory. It’s atwo-hour flight from my lab 
to the closest airport, inJohannesburg. Then 
Ihave to drive for 4.5 hours. In daylight, the 
rock surface at the site is at least 40 °C, and 
tools will burn your hand — and! can’t always 
get the equipment and supplies I need. 
Last November, I had to drive about 
40 kilometres to get dry ice to freeze some 
samples. I’d forgotten my wallet, but the 
shopkeeper said I could pay next time. That 
was incredibly heart-warming. 


Rose A. Marks is a postdoctoral researcher in 
molecular and cell biology at the University 
of Cape Town in South Africa. Interview by 
Abdullahi Tsanni. 


